June  1991 


AD-A254  553 

■IIIHIII 


•V  Report  No.  STAN-CS-91-1366 


Combinatorial  Algorithms  for  Optimization  Problems 


i 


* 


by 


Edith  Cohen 


DTIC 

ELECTE  " 
AUG  2 1 1992 

A 


Department  of  Computer  Science 

Stanford  University 
Stanford,  California  94305 


This  document  has  been  approved 
foi  public  release  and  sale;  its 
dictiibution  is  unlimited. _ _ 


92-23193 

■nnnm  m 

!9~o 


y2  b  19 


REPORT  DOCUMENTATION  PACE 


^cSSE^^SISS^S 

HK!i:vn!raCT.T  ^>!J};.LiI.mMMiNi,y>i.7TrCi.,:Jl^ 


omn%.97o*-»m 


IRIIUTION  /  AVAHAI 


UNt-lrtlTEO 


Abstract.  Linear  programming  is  a  very  general  and  widely  used  framework.  In  this  thesis  we  consider 
several  combinatorial  optimisation  problems  that  can  be  viewed  as  classes  of  linear  programming  problems 
with  special  structure.  It  is  known  that  polynomial  time  algorithms  exist  for  the  general  linear  programming 
problem.  It  is  not  known,  however,  whether  any  of  them  are  strongly  polynomial  In  addition,  it  seems  that 
the  general  problem  is  inherently  sequential.  For  problems  with  special  structure,  our  goals  are  to  develop 
sequential  and  parallel  algorithms  that  are  faster  than  those  known  for  general  linear  programming  and  to 
determine  whether  strongly  polynomial  algorithms  exist,  (i)  We  develop  a  technique  that  extends  the  classes 
of  problems  known  to  have  strongly  polynomial  algorithms,  or  known  to  be  quickly  solvable  in  parallel.  This 
technique  is  used  to  obtain  a  fast  parallel  algorithm  and  a  strongly  polynomial  algorithm  for  detecting 
cycles  in  periodic  graphs  of  fixed  dimension.  We  mention  additional  applications  to  parametric  extensions  of 
problems  where  the  number  of  parameters  is  fixed,  (ii)  We  introduce  algorithms  for  solving  linear  systems 
where  each  inequality  involves  at  most  two  variables.  These  algorithms  improve  over  the  sequential  and 
parallel  running  times  of  previous  algorithms.  These  results  are  combined  with  additional  ideas  to  yield 
faster  algorithms  for  some  generalised  network  flow  problems. 
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Abstract 


Linear  programming  is  a  very  general  and  widely  used  framework.  In  this  thesis  we 
consider  several  combinatorial  optimization  problems  that  can  be  viewed  as  classes 
of  linear  programming  problems  with  special  structure.  It  is  known  that  polynomial 
time  algorithms  exist  for  the  general  linear  programming  problem.  It  is  not  known, 
however,  whether  any  of  them  are  strongly  polynomial  (informally,  a  polynomial  time 
algorithm  is  strongly  polynomial  if  the  number  of  arithmetic  operations  performed  is 
bounded  by  a  polynomial  function  of  the  number  of  variables  and  inequalities,  i.e.,  is 
independent  of  the  size  of  the  numbers).  In  addition,  it  seems  that  the  general  problem 
is  inherently  sequential  (fast  parallel  algorithms  cannot  be  obtained).  For  problems 
with  special  structure,  our  goals  are  to  develop  sequential  and  parallel  algorithms 
that  are  faster  than  those  known  for  general  linear  programming  and  to  determine 
whether  strongly  polynomial  algorithms  exist. 

We  develop  a  technique  that  extends  the  classes  of  problems  known  to  have 
strongly  polynomial  algorithms,  or  known  to  be  quickly  solvable  in  parallel.  This  tech¬ 
nique  is  used  to  obtain  a  fast  parallel  algorithm  and  a  strongly  polynomial  algorithm 
for  detecting  cycles  in  periodic  graphs  of  fixed  dimension.  We  mention  additional 
applications  to  parametric  extensions  of  problems  where  the  number  of  parameters  is 
fixed. 

We  introduce  algorithms  for  solving  linear  systems  where  each  inequality  involves 
at  most  two  variables.  These  algorithms  improve  over  the  sequential  and  parallel 
running  times  of  previous  algorithms.  These  results  are  combined  with  additional 
ideas  to  yield  faster  algorithms  for  some  generalized  network  flow  problems. 
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Chapter  1 


Introduction 


1.1  Background 


Linear  programming  (LP)  is  a  very  widely  studied  and  commonly  used  class  of  opti¬ 
mization  problems  that  encompasses  many  combinatorial  optimization  problems.  A 
linear  programming  problem  of  n  variables  and  m  inequalities  consists  of  a  matrix 
A  €  iP"*"  and  two  vectors  6  €  iP",  c  6  iP*.  The  goal  is  to  find  a  vector  *  €  iP* 
(an  assignment  of  values  to  the  variables)  such  that  Ax  <b(x  satisfies  the  inequal¬ 
ities)  and  cT x  (the  value  of  the  objective  function)  is  maximized.  The  algorithms 
that  are  most  commonly  used  in  practice  are  variants  of  the  simplex  method,  due  to 
Dantzig  [16].  None  of  these  variants,  however,  is  known  to  run  in  polynomial  time 
on  all  instances.  The  existence  of  a  polynomial-time  algorithm  for  the  general  LP 
problem  was  in  doubt  until  Khachiyan  [40]  obtained  such  an  algorithm  by  modifying 
the  “ellipsoid  method,”  a  tool  used  in  nonlinear  optimization.  A  few  years  later,  Kar- 
markar  [37]  introduced  “interior  point  methods,”  which  hold  the  promise  of  yielding 
LP  algorithms  that  are  both  provably  polynomial  and  efficient  in  practice.  Many 
others  followed  (see  e.g.  [36,  54,  60]).  Currently,  the  best  worst-case  upper  bound  for 
the  problem  is  due  to  Vaidya  [60]. 
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Strongly  polynomial  algorithms:  Vaidvas  bound,  as  well  as  all  known  bounds 
on  the  running  times  of  general  LP  algorithms,  depends  not  only  on  n  and  m  but  also 
on  the  size  of  the  entries.  An  algorithm  for  a  subset  of  LP  problems  is  strongly  poly¬ 
nomial  (SP)  if  (i)  the  number  of  arithmetic  operations  is  bounded  by  a  polynomial 
in  m  and  n  and  (ii)  the  algorithm  does  not  generate  numbers  whose  size  (binary  rep¬ 
resentation)  is  larger  than  some  polynomial  function  of  the  input  size.  An  important 
open  problem  is  whether  general  LP  has  an  SP  algorithm. 

Asymptotic  notation:  We  use  the  standard  asymptotic  notation  to  measure  and 
compare  the  resource  use  (running  time,  space,  processors)  of  algorithms.  Suppose  g 
is  a  monotone  increasing  function  into  the  positive  reals  and  /  is  a  positive  function 
defined  on  the  same  domain  as  p.  f  —  0(g)  (resp.,  /  =  o(g),  f  =  D(g),  f  =  u>(p), 
/  =  6(g))  if  3c  >  0  such  that  lim/ / g  <  c  (resp.,  lim f/g  =  0,  f  /g  >  c  infinitely 
often,  lim  fig  =  oo,  /  =  0(g)  and  /  =  D(g)).  We  also  use  the  “soft  bound”  notation 
0(f)  =  0(f  polylog  /).  The  common  notation  for  the  class  of  problems  solvable  in 
polynomial  time  is  V. 

Randomized  Algorithms:  Randomized  algorithms  make  decisions  according  to 
the  outcomes  of  flipping  fair  coins.  Two  types  of  randomized  algorithms  are  men¬ 
tioned  in  the  literature,  Monte  Carlo  algorithms  and  Las  Vegas  algorithms.  An 
asymptotic  running  time  of  0(f)  is  interpreted  as  follows:  (i)  Deterministic  algo¬ 
rithms  are  guaranteed  to  terminate  in  0(f)  time,  (ii)  Monte  Carlo  algorithms  are 
guaranteed  to  terminate  in  0(f)  time  and  give  an  answer  that  is  correct  with  some 
constant  probability  p  >  2/3.  (iii)  Las  Vegas  algorithms  terminate  in  0(f)  expected 
time  and  are  guaranteed  to  return  a  correct  answer.  The  randomized  algorithms 
discussed  in  this  thesis  are  Las  Vegas  type. 

Parallel  Algorithms:  Within  the  class  V,  we  want  to  identify  problems  for  which 
we  can  benefit  by  using  a  parallel  computer,  where  many  processors  are  available  and 
able  to  work  concurrently.  The  PRAM  (parallel  random  access  machine)  [24]  is  the 
abstract  parallel  machine  model  that  takes  the  role  played  by  the  Turing  machine 
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or  RAM  in  sequential  computation.  The  PRAM  is  a  shared  memory  multiprocessor 
machine.  Several  types  of  PRAM’S  are  considered  in  the  literature,  according  to 
whether  they  allow  concurrent  read  and  write  operations  from  or  to  the  same  memory 
location.  The  CREW  model,  for  example,  allows  concurrent  reads  but  only  exclusive 
writes.  The  CRCW  model  allows  both  concurrent  reads  and  concurrent  writes.  A 
parallel  implementation  of  a  particular  algorithm  has  optimal  speedup  if  the  product  of 
processors  and  time  is  of  the  order  of  the  sequential  running  time.  A  complexity  class 
that  somewhat  captures  the  notion  of  efficient  parallel  computation  is  A fC.  The  class 
A fC  was  introduced  by  Pippenger  [53]  and  is  robust  in  the  sense  that  it  applies  to  many 
parallel  machine  models.  A  problem  is  in  AfC  if  it  can  be  solved  in  polylogarithmic 
time  using  a  polynomial  number  of  processors.  Obviously,  AfC  C  V.  A  fundamental 
open  problem  is  whether  V  =  AfC.  A  problem  is  P-complete  if  the  existence  of  an  AfC 
algorithm  for  it  implies  V  =  AfC,  or  equivalently,  if  every  problem  in  V  can  be  reduced 
to  it  using  logarithmic  space.  'P-complete  problems  are  the  “hardest”  problems  in 
P.  General  LP  was  shown  to  be  P-complete  by  Dobkin,  Lipton,  and  Rice  [18].  The 
common  belief  is  that  P  AfC.  Hence,  P-complete  problems  in  general,  and  LP  in 
particular,  are  viewed  to  be  inherently  sequential. 


Examples  of  classes  of  LP  problems  with  special  structure:  When  a  class  of 
LP  problems  with  special  structure  is  considered,  one  might  try  to  find  either  an  SP 
(resp.,  AfC)  algorithm  for  this  class  or  an  SP  (resp.,  logspace)  reduction  of  general  LP 
problems  to  problems  of  this  class  (see  [10]).  Such  a  reduction  asserts  that  proving 
that  this  class  is  SP  (resp.,  in  AfC)  is  as  hard  as  proving  the  same  for  general  LP.  In 
the  thesis  we  consider  LP  problems  of  special  structure.  We  Me  concerned  both  with 
the  quantitative  problem  of  improving  the  parallel  and  sequential  time  bounds  and 
with  the  qualitative  question  of  strong  polynomiality.  We  give  examples  of  classes  of 
LP  problems  that  are  “easier”  than  general  LP: 


•  The  maximum  flow  problem  has  an  SP  algorithm  due  to  Edmonds  and  Karp  [21] 
and  Dinic  [17]  (better  bounds  are  given  in  [28,  52]). 
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•  The  more  general  minimum-cost  circulation  problem  was  shown  by  Tardos  [58] 
to  have  an  SP  algorithm  (see,  e.g.,  [2,  27]  for  better  bounds). 

•  Tardos  [59]  generalized  her  min-cost  circulation  SP  algorithm  to  LP  instances 
where  the  entries  in  the  matrix  A  are  bounded  by  a  polynomial  in  m  —  n  (b 
and  c  can  still  be  general). 

•  Megiddo  [47]  gave  a  linear  time  algorithm  for  LP  problems  with  a  fixed  number 
of  variables. 

•  Megiddo  [46]  gave  an  SP  algorithm  for  solving  linear  systems  with  at  most  two 
variables  per  inequality.  Faster  sequential  and  parallel  algorithms  are  presented 
in  Chapter  5. 

Generalized  circulation  (see  [42])  is  an  interesting  network  flow  problem.  It  is  not 
known  to  have  an  SP  algorithm  and  there  is  no  known  SP  reduction  of  general  LP 
problems  to  it.  We  present  some  partial  results  in  Chapter  6. 

1.2  Outline  of  the  thesis 

The  results  included  in  the  thesis  are  joint  work  with  Nimrod  Megiddo.  The  thesis 
contains  two  disjoint  sets  of  results.  The  first  set  consists  of  Chapters  2,  3,  and  4 
(an  extended  abstract  appeared  in  [6],  see  also  [7,  9]).  The  second  set  consists  of 
Chapters  5  and  6  (an  extended  abstract  appeared  in  [11]).  Chapter  7  contains  a 
conclusion.  Each  chapter  is  more  or  less  independent  of  the  other  chapters. 

In  Chapter  2  we  present  an  algorithm  to  detect  the  existence  of  directed  cycles  in 
periodic  graphs.  A  d-dimensional  periodic  graph  is  an  infinite  digraph,  where  isomor¬ 
phic  finite  sets  of  vertices  are  associated  with  the  points  of  the  d-dimensional  grid. 
Periodic  graphs  have  a  very  regular  structure;  the  edges  axe  such  that  the  periodic 
graph  “looks  the  same”  from  any  grid  point.  A  periodic  graph  can  be  represented 
by  a  finite  directed  graph  where  d-dimensional  integer  vectors  are  associated  with 
the  edges.  When  resolving  graph  properties  of  a  periodic  graph,  we  would  like  to 
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find  algorithms  whose  running  time  is  polynomial  in  the  size  of  the  directed  graph 
representing  it.  Periodic  graphs  in  general,  and  the  cycle  detection  problem  in  partic¬ 
ular,  were  studied  in  previous  papers  [34,  39,  41,  51].  Previous  algorithms,  however, 
solved  the  problem  by  reducing  it  to  polynomially  many  LP  problems  (see  [39,  41]). 
These  results  left  open  the  existence  of  strongly  polynomial  or  AfC  algorithms  for  the 
problem.  The  algorithm  presented  in  Chapter  2  is  a  strongly  polynomial  and  A^C 
algorithm,  when  the  dimension  d  is  fixed.  To  complement  the  result  we  also  show 
that  when  d  is  part  of  the  input,  the  existence  of  a  strongly  polynomial  time  or  AfC 
algorithm  for  the  problem  implies  the  existence  of  such  an  algorithm  for  general  LP 
problems.  We  also  show  how  the  same  algorithm  can  be  applied  to  compute  strongly 
connected  components  of  periodic  graphs  and  to  schedule  the  computation  of  systems 
of  uniform  recurrences. 

The  cycle  detection  algorithm  is  based  on  reducing  the  pi~blem  to  solving  in¬ 
stances  of  the  parametric  minimum  cycle  problem  with  d  parameters.  The  latter 
problem,  which  is  interesting  in  its  own  right,  can  be  solved  by  a  sequence  of  LP 
problems.  In  Chapter  3  we  present  a  method  that  allows  us  to  perform  the  com¬ 
putation  in  SP  time  and  in  AfC.  The  parametric  minimum  cycle  problem  is  defined 
as  follows.  Consider  a  digraph  where  weights  are  associated  with  the  edges.  The 
weights  are  linear  functions  of  d  variables  (“parameters”).  Each  set  of  values  for  the 
parameters  corresponds  to  a  set  of  scalar  weights  associated  with  the  edges  of  the 
graph.  The  goal  is,  roughly,  to  find  the  set  of  values  for  which  the  weight  of  the 
minimum-weight  cycle  is  maximized. 

The  purpose  of  Chapter  4  is  to  present  the  algorithm  of  Chapter  3  as  a  general 
method  to  achieve  strongly  polynomial  bounds.  The  scheme  used  to  maximize  the 
function  that  maps  sets  of  parameter  values  to  the  value  of  the  minimum-weight 
cycle  can  actually  be  applied  to  minimize  (resp.,  maximize)  large  family  of  convex 
(resp.,  concave)  functions.  The  minimization  requires  a  number  of  operations  that  is 
polynomial  in  the  number  of  operations  needed  to  evaluate  the  function  (when  the 
dimension  of  the  domain  is  fixed).  In  Chapter  4  we  omit  many  of  the  details  specific 
to  the  parametric  minimum  cycle  problem.  To  allow  for  independent  reading  many 
definitions  and  statements  are  repeated.  For  some  of  the  proofs,  however,  the  reader 
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is  referred  to  the  appropriate  place  in  Chapter  3. 

In  Chapter  5  we  present  faster  algorithms  to  solve  linear  systems  of  inequalities 
where  at  most  two  variables  appear  in  each  inequality  (TVPI  systems).  We  give  a 
deterministic  0(mn2)  time  algorithm  and  a  randomized  0(ns  +  mn )  expected  time 
algorithm,  where  m  is  the  number  of  inequalities  and  n  is  the  number  of  variables. 
In  parallel,  these  algorithms  run  in  0{n)  time  with  optimal  speedup.  The  previously 
best  known  algorithm  due  to  Megiddo  runs  in  0(mn3  log  m)  time  sequentially,  and 
0(n3logm)  time  with  optimal  speedup  in  parallel  [46]. 

Chapter  6  is  concerned  with  generalized  network  flow  problems.  In  a  generalized 
network,  each  edge  e  =  ( u,v )  has  a  positive  “flow  multiplier”  ae  associated  with  it. 
The  interpretation  is  that  if  a  flow  of  xe  enters  the  edge  at  node  u,  then  a  flow  of  aexe 
exits  the  edge  at  v.  We  present  algorithms  for  generalized  network  flow  problems  that 
utilize  the  results  of  Chapter  5. 

The  uncapacitated  generalized  transshipment  problem  (UGT)  is  defined  on  a  gen¬ 
eralized  network  where  demands  and  supplies  (real  numbers)  are  associated  with  the 
vertices  and  costs  (real  numbers)  are  associated  with  the  edges.  The  goal  is  to  find 
a  flow  such  that  the  excess  or  deficit  at  each  vertex  equals  the  desired  value  of  the 
supply  or  demand,  and  the  sum  over  the  edges  of  the  product  of  the  cost  and  the  flow 
is  minimized.  Adler  and  Cosares  [1]  reduced  the  restricted  uncapacitated  generalized 
transshipment  problem,  where  only  demand  nodes  are  present,  to  solving  a  single 
TVPI  system.  Therefore,  the  algorithms  of  Chapter  5.1.1  result  in  a  faster  algorithm 
for  restricted  UGT. 

Generalized  circulation  is  defined  on  a  generalized  network  with  demands  at  the 
nodes  and  capacity  constraints  on  the  edges  (i.e.,  upper  bounds  on  the  amount  of 
flow).  The  goal  is  to  find  a  flow  such  that  the  flow  excesses  at  the  nodes  are  propor¬ 
tional  to  the  demands  and  maximized.  We  present  a  new  algorithm  that  solves  the 
capacitated  generalized  flow  problem  by  iteratively  solving  instances  of  UGT.  The 
algorithm  can  be  used  to  find  an  optimal  flow  or  an  approximation.  When  used  to 
find  a  constant  factor  approximation,  the  algorithm  yields  a  bound  that  is  not  only 
more  efficient  than  previous  algorithms  but  also  SP.  This  is  the  first  SP  approximation 
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algorithm  for  generalized  circulation;  the  existence  of  this  approximation  algorithm 
is  particularly  interesting  since  it  is  not  known  whether  the  problem  has  an  SP  algo¬ 
rithm. 

1.3  Notation 

We  use  boldface  notation  for  vectors  and  matrices.  Suppose  U  is  a  matrix  and  d  is 
a  column  vector,  denote  by: 
di  -  the  i’th  entry  in  the  vector  d, 

Uij  -  the  ij  entry  in  the  matrix  U, 

U ^  -  the  i’th  row  of  the  matrix  U, 

U,j  -  the  j’th  column  of  the  matrix  U, 

UT  -  the  transposed  matrix  (Vi,  j, Uij  =  Uj{),  and 
dT  -  the  corresponding  row  vector. 

We  use  the  following  special  vectors: 
e  is  the  vector  with  all  entries  1, 

0  is  the  vector  will  all  entries  0,  and 
e‘  is  such  that  e\  =  1,  and  for  j  ^  i,  e‘-  =  0. 

Suppose  Ai  (1  <  i  <  fc)  are  sets  of  dements.  Denote  by: 

| i4*|  the  number  of  dements  in  the  set  Ai. 

Ai  x  Aj  the  cross  product  of  Ai  and  Aj,  that  is,  the  set  of  ordered  pairs 
(dj,  a.j)  where  a<  €  Ai,  aj  €  Aj. 

Xi <i<k  Ai  the  cross  product  of  the  sets  A\, . . . ,  A*,  that  is,  the  set  of  fc-tuples 
(aa, . . . ,  a*)  where  a,-  €  A»  (1  <  i  <  k). 

Aj  the  set  of  all  /-tuples  of  dements  of  A,-. 

AJ"xn  the  set  of  all  m  x  n  matrices  whose  entries  are  dements  of  Aj. 

We  use  the  notation  R,  J2+,  Q,  Z,  and  N,  for  the  sets  of  real  numbers,  positive 
real  numbers,  rational  numbers,  integers,  and  natural  numbers,  respectivdy.  Hence, 
Rk  denotes  the  fc-dimensional  Euclidean  space, 

Qk  denotes  the  fc-dimensional  linear  space  over  the  rationals,  and 
Zk  denotes  the  k-dimensional  integer  grid. 
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Consider  two  sets  F  C  Rn  and  F'  C  RF . 

F  —  F'  =  {y  £  Rn \y  =  x  —  x' ,  where  x  £  F,x'  £  F'}  is  the  sum  of  F  and  F'. 
When  every  y  £  F  +  F'  can  be  uniquely  represented  as  y  =  x  -f  x'  where  x  £  F, 
x'  £  F',  we  refer  to  F  ©  F1  =  F  4-  F'  as  the  direct  sum  of  F  and  F'. 

Denote  the  projection  of  F  on  the  coordinates  J  C  {1, . .  • ,n}  by  Fj  C  R^-  Also, 

•  F  is  convex  iff  for  every  x  £  F,  y  £  F  and  0  <  a  <  1,  we  have  ax-h(l  —  a)y  £  F. 

•  F  is  a  linear  subspace  iff  F  =  {a:  €  RF\Ax  =  0}  for  some  matrix  A  £  FT nxn. 

•  For  a  linear  subspace  F,  denote  the  orthogonal  complement  of  F  by  F1-  =  {x  £ 
RF\Vy  £  F,xTy  =  0}. 

•  F  is  a  flat  iff  F  =  {a:  €  RF\Ax  =  6}  for  some  matrix  A  £  fTnxn  and  vector 
6  €  RF1.  The  subspace  parallel  to  the  flat  F  is  {x\Ax  =  0}. 

•  F  is  a  cone  iff  for  all  x,y  £  F  and  a  >  0,  >  0,  ax  +  fly  £  F.  A  cone  is 

pointed  iff  it  does  not  contain  a  linear  subspace. 

•  The  lineality  space  of  a  cone  F  is  the  largest  subspace  L  such  that  for  all  *  €  F, 
x  +  L  C  F. 

We  use  the  notation: 

aff  F  for  the  affine  hull  of  F,  that  is,  the  smallest  flat  which  contains  F, 

lin  F  for  the  subspace  spanned  by  F, 

interior  F  for  the  interior  of  F,  and 

relint  F  for  the  relative  interior  (interior  relative  to  aff  F). 

We  denote  by  g  :  A  — »  B  a  function  from  a  domain  A  into  B.  For  a  subset 
H  C  A  of  the  domain,  denote  by  g\H  the  restriction  of  g  to  H,  that  is,  the  function 
g\H  :  H  —*  B  such  that  Va  €  H,g(a)  =  g\H(a). 

A  function  g  :  RF  —>  R  is  convex  (resp.,  concave)  if  for  all  x  €  RF,  y  £  RF,  and 
0  <  a  <  1,  g(ax  +  (1  -  a)y)  <  ap(x)  +  (1  -  a)p(y)  (resp.,  g(ax  +  (1  -  a)y)  > 
ag(x)  +  (1  -  a)g(y)). 
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By  G  =  (V,  E)  we  denote  a  directed  graph,  where  V  is  the  set  of  vertices  and  E 
is  the  set  of  edges.  We  use  the  convention  il’j  =  n.  j E  =  m.  By  G  =  ( Y.E.  f )  we 
denote  a  directed  graph  with  a  “weight’-  function  /  :  E  which  associates  weights  with 
the  edges.  For  a  vertex  v  €  V,  let  in(t>)  C  V  and  out(r)  C  1’,  respectively,  denote 
the  sets  of  edges  entering  and  leaving  v. 

Suppose  xT  =  (®x, . . . ,  xn )  are  indeterminate. 

•  g(xi , . . . ,  xn )  is  a  linear  function  if  there  exist  real  numbers  Co, . . . ,  Cn  such  that 
g(xu  . . . ,  xn)  =  Co  -r  £”=1  c,x.-. 

•  Sr=i  CjZj  =  Co  is  a  linear  equation. 

•  5T?=i  c'x'  —  co  and  J2?=i  <  Co  are  linear  inequalities. 

•  Both  linear  equations  and  linear  inequalities  are  referred  to  as  linear  constraints. 

•  A  linear  system  is  a  set  of  linear  constraints.  Set  of  m  inequalities  is  represented 
by  a  matrix  A  €  _RTnXn  and  a  vector  b  €  R"1,  as  Ax  <  b.  A  vector  is  feasible  if 
it  satisfies  all  the  constraints. 

•  A  linear  programming  problem  consists  of  a  linear  system  and  a  linear  objective 
function  given  by  a  vector  c  €  JT*.  The  goal  is  to  maximize  (or  minimize)  cTx 
subject  to  the  inequalities  Ax  <  b. 


Chapter  2 

Detecting  cycles  in  periodic 
graphs 


This  chapter  is  concerned  with  the  problem  of  recognizing,  in  a  graph  with  rational 
vector- weights  associated  with  the  edges,  the  existence  of  a  cycle  whose  total  weight  is 
the  zero  vector.  This  problem  is  known  to  be  equivalent  to  the  problem  of  recognizing 
the  existence  of  cycles  in  periodic  (dynamic)  graphs  and  to  the  validity  of  systems 
of  recursive  formulas.  It  was  previously  conjectured  that  combinatorial  algorithms 
exist  for  the  cases  of  two-  and  three-dimensional  vector-weights.  This  chapter  gives 
strongly  polynomial  algorithms  for  any  fixed  dimension.  Moreover,  these  algorithms 
also  establish  membership  in  the  class  NC.  On  the  other  hand,  it  is  shown  that  when 
the  dimension  of  the  weights  is  not  fixed,  the  problem  is  equivalent  to  the  general 
linear  programming  problem  under  strongly  polynomial  and  logspace  reductions.  The 
algorithms  discussed  here  are  based  on  reducing  the  problem  to  solving  instances  of 
the  parametric  minimum  cycle  problem.  When  the  dimension  of  the  vector-weights 
is  fixed,  the  problem  can  be  solved  within  the  same  time  bound  of  solving  an  instance 
of  the  parametric  minimum  cycle  problem  on  the  same  graph.  The  latter  problem  is 
defined  in  Chapter  3,  where  we  present  MC  and  strongly  polynomial  algorithm  for  it 
when  the  number  of  parameters  is  fixed.  Earlier  versions  of  the  results  presented  in 
this  chapter  appeared  in  [6,  9]. 
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2.1  Introduction 

This  chapter  is  concerned  with  the  following  problem: 

Problem  2.1.1  Given  is  a  digraph  G  =  (V,E,f)  where  /  :  E  —*  Rd  associates 
with  each  edge  of  G  a  d-vector  of  rational  numbers.  Determine  whether  G  contains 
a  zero-cycle,  i.e.,  a  cycle  whose  edge  vectors  sum  to  the  zero  vector. 

Problem  2.1.1  is  important  since  it  has  been  shown  by  Iwano  and  Steiglitz  [34]  that 
an  infinite  “periodic  graph”  has  a  cycle  if  and  only  if  the  “dependence  graph”  which 
generates  it  has  a  zero-cycle.  The  periodic  graph  is  well-defined  when  the  weights  are 
integral.  Given  a  graph  G  =  (V,E)  with  integral  vector- weights  c^-  €  Rd  (called  the 
dependence  graph),  the  corresponding  periodic  graph  is  generated  as  follows.  Place 
a  copy  of  the  vertex  set  of  G  at  each  point  of  the  integral  lattice  in  Rd.  For  every 
lattice  point  z  and  for  every  edge  (i,j)  €  E,  connect  the  copy  of  vertex  i  that  is 
located  at  z  with  the  copy  of  vertex  j  that  is  located  at  z  +  c,j.  The  problem  on 
this  infinite  periodic  graph  is  to  identify  a  cycle  or  conclude  that  none  exists.  See 
figures  2.1  and  2.2  for  examples  of  dependence  graphs  with  corresponding  periodic 
graphs. 

The  problem  was  first  introduced  in  a  paper  by  Karp,  Miller  and  Winograd  [39], 
where  it  was  raised  in  the  context  of  recursive  definitions.  They  gave  an  algorithm 
which  amounts  to  solving  polynomially  many  linear  programs.  They  stated  the  prob¬ 
lem  of  validity  of  recursive  definitions  as  detecting  a  cycle  in  a  periodic  graph.  Con¬ 
sider  n  functions  Fi,...,Fn  on  the  d-dimensional  integral  lattice  defined  by 

=  MFiiz  ~  Cii).  •  •  • » F*{*  ~  Cin))  , 

(the  Cij’ s  are  integral  vectors)  where  the  ipi’s  are  specified  (assume  for  simplicity  the 
boundary  conditions  F(z)  =  0  for  z  £  R+)-  The  corresponding  dependence  graph  has 
n  vertices  t>j, . . . ,  vn,  and  edges  (vi,Vj)  of  weight  c,-;  if  F,(z)  depends  on  F,(z  —  dj)- 
In  order  for  the  functions  to  be  well  defined  it  is  necessary  and  sufficient  that  there 
will  be  no  cycle  whose  total  vector  weight  is  nonnegative.  The  problem  of  detecting  a 
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Figure  2.1:  A  one-dimensional  dependence  graph  G  and  the  periodic  graph  G ’ 

nonnegative  cycle  can  be  reduced  to  the  problem  of  detecting  a  zero  cycle  by  adding 
at  each  vertex  i,  d  loops  with  weights  (  —  1, 0, . . . ,  0),  (0,  —1, . . . ,  0),  ...  ,  (0, . . . ,  0,  — 1). 
As  an  example,  consider  the  recurrence  relation  F(z)  =  F(z  —  1)  +  F(z  —  2).  The 
corresponding  dependence  graph  consists  of  a  single  vertex  and  two  loops  of  weights 
—  1  and  —2. 

The  problem  was  re-introduced  by  Iwano  and  Steiglitz  [34]  following  Orlin  [51]. 
Orlin  studied  properties  of  one- dimensional  periodic  graphs  which  included  computing 
strongly  connected  components.  Kosaraju  and  Sullivan  [41]  presented  a  polynomial¬ 
time  algorithm.  The  time  complexity  of  the  latter  is  0(Zn log  u),  where  Z  is  the 
complexity  of  a  certain  linear  programming  problem  with  2m  variables  and  m  •+  n  4-  fc 
constraints  (n  =  | V|,  m  =  |2£]).  They  also  presented  O(Z)  algorithms  for  the  cases 
d  =  2,3. 

It  was  conjectured  in  [41]  that  combinatorial  algorithms  exist  for  the  cases  d  =  2, 3. 
Although  the  notion  of  a  combinatorial  algorithm  is  not  well-defined,  we  believe 
we  have  confirmed  this  conjecture  and  have  in  fact  proven  much  more  than  what 
was  expected.  We  show  that  not  only  for  d  =  2,3,  but  also  for  any  fixed  d,  the 
problem  can  be  solved  in  strongly  polynomial  time,  and  indeed  by  “combinatorial” 
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Figure  2.2:  A  1  wo- dimensional  dependence  graph  G  and  the  periodic  graph  G~ 

algorithms.  Furthermore,  our  algorithms  can  be  implemented  on  a  parallel  machine 
sc  that  membership  in  the  class  NC  is  established  for  any  fixed  d.  To  complement 
these  results,  we  also  show  that  the  general  problem  (where  d  is  considered  part  of  the 
input)  is  as  hard  as  the  general  linear  programming  problem  in  a  sense  as  follows.  We 
show  that  any  linear  programming  problem  can  be  reduced  in  strongly  polynomial 
time  and  logspace  to  our  problem  with  general  d. 

Consider  an  instance  of  Problem  2.1.1  where  m  =  \E\  is  the  number  of  edges  and 
n  =  |t/|  is  the  number  of  vertices.  We  show  that  when  d  is  fixed  the  problem  can  be 
solved  within  the  following  bounds: 

i.  0(log2in  +  logrfm)  parallel  time  on  0(n3  +  m)  processors. 

ii.  0(m(logMn  -f  log^m))  sequential  time,  when  m  =  fl(n3logn). 

iii.  0((n3  -f  m)logMn)  sequential  time,  when  m  =  0(n3  log  n)  and  m  =  Q(nJ). 
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iv.  0(n3log2^d  2^ti  —  nmlog2^d  ^  n)  sequential  time,  when  m  =  0(n2). 

The  constant  factors  hidden  in  the  above  bounds  are  of  the  order  of  0(3^),  and  arise 
from  the  multi- dimensional  search  algorithm  [5,  19,  47].  The  dominating  factor  in 
the  time  complexity  arises  from  solving  0(d)  instances  of  the  parametric  minimum 
cycle  problem  with  d  —  1  parameters,  m  edges  and  n  nodes  (see  Chapter  3). 

It  is  worth  mentioning  that  other  properties  of  periodic  graphs  were  considered  in 
the  literature.  The  computation  of  strongly  connected  components  [51,  9]  (see  Sec¬ 
tion  2.6.2),  scheduling  the  computation  of  systems  of  recursive  definitions  [9,  39,  56] 
(see  Section  2.6.1),  planarity  testing  [33],  computing' connected  components,  recog¬ 
nizing  bipartiteness  [12,  35,  51],  and  computing  a  minimum  average  cost  spanning 
tree  [12,  51]. 

In  Section  2.2  we  give  some  necessary  definitions.  In  Section  2.3  we  give  an 
overview  of  the  basic  ideas  underlying  our  algorithms.  In  Section  2.4  we  describe  a 
strongly  polynomial  algorithm  for  detecting  zero-cycles.  This  algorithm  is  stated  in 
terms  of  solving  instances  of  a  parametric  version  of  the  minimum  cycle  problem.  The 
latter  problem  and  a  strongly  polynomial  algorithm  for  it  are  introduced  in  Chapter  3. 
Section  2.5  contains  some  necessary  geometric  lemmas.  Section  2.6  discusses  two  other 
problems  on  periodic  graphs  for  which  the  cycle  detection  algorithm  is  applicable.  One 
problem  is  computing  the  strongly  connected  components,  the  other  is  scheduling  the 
computation  of  systems  of  uniform  recurrences  which  are  modeled  by  periodic  graphs. 
Concluding  remarks  are  given  in  Section  2.7. 


2.2  Preliminaries 

Definition  2.2.1  Given  a  graph  G  =  ( V ,  E ),  a  circulation  x  —  (xij)  (( i,j )  EE)  is  a 
solution  of  the  system: 

(*»i  ~  zi«)  =  0  (t  =  1, . . . ,  n) 


x  >  0  . 


2.3.  AN  OVERVIEW 
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Let  E(x)  denote  the  set  of  active  edges,  i.e.,  edges  (i,j)  with  Xij  >  0.  Vertices  incident 
on  active  edges  are  said  to  be  active  in  x.  If  the  active  edges  form  a  connected 
subgraph  of  G,  then  we  say  that  the  circulation  x  is  connected.  If  these  edges  form  a 
simple  cycle,  then  we  say  that  a:  is  a  simple  cycle,  and  with  no  ambiguity  we  continue 
to  talk  about  the  set  of  active  vertices  as  a  simple  cycle. 

Remark  2.2.2  Every  circulation  a:  is  a  sum  of  connected  circulations,  corresponding 
to  the  decomposition  of  E(x)  into  strongly  connected  components.  Moreover,  it  is 
also  well  known  (and  easy  to  see)  that  every  circulation  can  be  represented  as  a  sum 
of  simple  cycles.  If  a  connected  circulation  x  =  (xij)  consists  of  rational  numbers, 
then  it  is  proportional  to  an  integral  circulation.  A  connected  integral  circulation 
can  be  represented  by  a  cycle  (u0,Ui, . . .  ,uq  =  u0)  ((u;_i,iii)  €  E),  not  necessarily 
simple,  where  is  interpreted  as  the  number  of  times  the  edge  (i,j)  is  traversed 
throughout  the  cycle.  It  is  easy  to  construct  irrational  circulations  that  cannot  be 
interpreted  this  way. 

Definition  2.2.3  Given  vector  weights  dj  =  (cL, . . . ,  c*)T  ((i,j)  €  E)  (i.e.,  using 
the  notation  of  Problem  2.1.1,  dj  =  /(e)  where  e  =  (x,  j)),  a  circulation  x  =  (x;j)  is 
called  a  zero- circulation  if  it  satisfies  the  vector  equation  dj*ij  —  0-  An  integral 
connected  nontrivial  zero-circulation  is  called  a  zero-cycle. 


2.3  An  overview 

We  first  present  an  informal  overview  of  the  basic  ideas  involved  in  the  zero-cycle 
detection  algorithm. 

Suppose  G  =  (V,  E ,  /)  contains  a  vector  zero-cycle  C,  i.e.,  the  sum  of  the  vector 
weights  dj  around  C  is  equal  to  the  zero  vector.  Obviously,  for  any  A  €  R?,  the  sum 
of  the  scalar  weights  A Tdj  around  C  is  zero.  It  follows  that  for  every  A  €  R?,  the 
weight  of  the  minimum  cycle  relative  to  the  scalars  A Tdj  is  nonpositive.  In  other 
words,  if  there  exists  a  A  €  R?  such  that  all  the  cycles  are  positive  relative  to  A Tdj, 
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then  this  A  certifies  that  there  are  no  zero-cycles.  On  the  other  hand,  it  can  be 
shown  that  if  for  every  A  ^  0  there  exists  a  negative  cycle,  then  there  exists  a  vector 
zero- cycle. 

The  observation  of  the  preceding  paragraph  suggests  that  one  might  first  attempt 
to  find  a  A  for  which  all  the  cycles  are  positive  relative  to  the  weights  A TC{j.  In  other 
words,  we  wish  to  maximize  over  A  the  weight  of  the  minimum  cycle  relative  to  the 
scalar  weights  A T Cij. 

This  task  can  be  viewed  as  a  parameterized  extension  of  the  well-known  problem 
of  detecting  the  existence  of  a  negative  cycle  or  finding  a  minimum  weight  cycle  in  a 
graph  with  scalar  weights.  The  latter  scalar  weights  problem  cam  be  viewed  as  asking 
for  an  evaluation  of  a  function  at  a  given  A,  which  can  be  solved  by  running  an  all 
pairs  shortest  paths  algorithm.  The  search  for  A  els  above  can  be  formulated  as  an 
optimization  problem  over  the  A-space,  where  one  seeks  to  maximize  the  function  of 
the  minimum  weight  of  any  cycle  relative  to  the  ATCj,’s.  However,  there  is  a  certain 
difficulty  with  this  approach  since  the  minimum  is  not  well-defined  when  there  are 
negative  cycles.  Note  that  we  do  not  require  the  cycle  to  be  simple,  since  the  problem 
of  finding  a  minimum  simple  cycle  is  AA'P-hard.  However,  we  can  instead  consider 
one  of  the  following  quantities:  (i)  the  minimum  cycle-mean,  i.e.,  the  minimum  of 
the  average  weight  per  edge  of  the  cycle,  or  (ii)  the  minimum  of  the  total  weight  of 
cycles  (not  necessarily  simple)  consisting  of  at  most  n  edges.  It  is  easy  to  see  that 
the  sign  of  the  minimum  cycle-mean  (which  is  the  same  as  the  sign  of  the  quantity 
defined  in  (ii))  distinguishes  the  following  three  cases:  I.  there  exists  a  negative  cycle, 
II.  there  exists  a  zero  cycle  but  no  negative  cycles,  and  III.  all  the  cycles  are  positive. 

If  an  algorithm  for  either  (i)  or  (ii)  of  the  preceding  paragraph  is  given,  which 
uses  only  additions,  comparisons  and  multiplications  by  constants,  then  such  an  al¬ 
gorithm  can  be  “lifted”  to  solve  the  optimization  problem.  Very  roughly,  the  basic 
idea  (which  is  explained  in  [44,  45,  46])  is  to  run  the  given  algorithm  simultaneously 
on  a  continuum  of  values  of  A,  while  repeatedly  restricting  the  set  of  these  values, 
until  the  optimum  is  found.  Another  interpretation  of  the  lifted  algorithm  is  that  it 
operates  on  linear  forms  rather  than  constants.  When  the  lifted  algorithm  needs  to 
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compare  two  linear  forms,  it  first  computes  a  hvperplane  which  cuts  the  space  into 
two  halfspaces,  such  that  the  outcome  of  the  comparison  is  uniform  throughout  each 
of  them.  The  algorithm  then  consults  an  “oracle”  (whose  details  are  given  later) 
for  selecting  the  correct  halfspace,  and  moves  on.  The  lifted  algorithm  maintains  a 
polyhedron  P  which  is  the  intersection  of  the  correct  halfspaces. 

As  noted  above,  if  a  vector  A  is  found  such  that  all  the  cycles  are  positive  then  we 
are  done.  Otherwise,  the  lifted  algorithm  concludes  that  A  =  0  is  an  optimal  solution, 
i.e.,  for  every  A  there  exists  a  nonpositive  cycle,  so  the  choice  of  A  =  0  maximizes  the 
weight  of  a  minimum  cycle.  However,  the  zero  vector  itself  does  not  convey  enough 
information.  Nonetheless,  the  algorithm  actually  computes  a  vector  A  ^  0  (called 
a  separating  vector)  in  the  relative  interior  of  the  set  of  optima1 ,  along  with  a  “cer¬ 
tificate”  of  optimality.  The  certificate  consists  of  vector  circulation  values  Ci, . . .  ,Cr- 
These  values  span  in  nonnegative  linear  combinations  a  suitable  linear  space,  proving 
that  there  is  no  direction  to  move  so  that  the  minimum  cycle  becomes  positive.  This 
“certificate”  is  used  to  actually  find  a  zero-cycle  when  the  algorithm  decides  that  one 
exists.  The  scalar  weights  A  Cij  then  induce  a  decomposition  of  the  graph,  where  two 
vertices  are  in  the  same  component  if  they  belong  to  the  same  scalar  zero-cycle.  It 
is  then  shown  that  a  vector  zero- cycle  exists  in  the  given  graph  if  and  only  if  such  a 
cycle  exists  in  one  of  the  components.  Also,  if  there  is  only  one  component  (and  the 
graph  has  more  than  one  vertex)  then  there  exists  a  zero-cycle.  These  observations 
suggest  an  algorithm  which  iteratively  computes  a  separating  vector,  decomposes  the 
graph  accordingly,  and  works  on  the  components  independently.  The  depth  of  the 
decomposition  tree  is  bounded  by  the  dimension  of  the  weights. 

The  part  we  have  so  far  left  open  is  the  “oracle”  which  recognizes  the  correct 
halfspace.  It  turns  out  that,  as  in  [47],  the  oracle  can  be  implemented  by  recursive 
calls  to  the  same  algorithm  in  a  lower  dimension.  This  will  be  explained  later  in  the 
chapter. 

We  have  outlined  the  general  framework  for  establishing  the  qualitative  result  of 

lThere  is  also  the  possibility  that  the  sero  vector  is  the  only  optimal  solution,  so  there  is  no 
separating  vector.  However,  in  this  case,  assuming  strong  connectivity  of  the  graph,  it  can  be  shown 
that  a  sero-cyde  exists. 
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strongly  polynomial  time  bounds  for  any  fixed  dimension.  However,  to  get  more  effi¬ 
cient  algorithms  and  to  establish  membership  in  Al'C ,  we  perform  multi-dimensional 
searches  as  in  [5,  19,  47].  By  doing  so  we  reduce  the  number  of  calls  to  an  “oracle” 
algorithm  which  actually  need  to  be  performed,  to  a  polvlog  in  the  number  of  deci¬ 
sions.  The  design  can  be  viewed  as  an  integration  of  the  techniques  of  [45]  and  [47] 
(and  the  further  improvements  of  [5,  19]). 


2.4  Detecting  zero-cycles 

In  this  section  we  develop  an  algorithm  which  decides  the  existence  of  a  zero-cycle  in 
the  vector- weighted  graph  G  =  (V,  E,  f),  f  :  E  — ►  Zd.  If  a  zero-cycle  exists  in  G,  we 
find  an  explicit  one.  The  algorithm  introduced  in  this  section  uses  as  a  subroutine 
the  parametric  minimum  cycle  algorithm  of  Chapter  3. 

Proposition  2.4.1  A  graph  G  —  ( V,E,f )  with  vector  weights  (see  Problem  2.1.1) 
has  a  zero-cycle  (see  Definition  2.2.  S)  if  and  only  if  it  has  a  connected  zero- circulation. 

Proof:  Note  that  if  there  exists  a  connected  zero- circulation  then  there  exists  a 
rational  connected  one.  Hence,  there  exists  an  integral  connected  zero-circulation 
which  is  equivalent  to  a  zero-cycle  (see  Remark  2.2.2).  ■ 

Definition  2.4.2  Given  a  vector-weighted  graph  G  =  (V,  E,  f),  we  use  the  following 
definitions  and  notation: 

i.  Let  AC  denote  the  cone  of  vectors  A  =  (Ai, . . . ,  A*)r  for  which  the  scalar- weighted 
graph  (V,  E,  fT A)  has  no  negative  cycles. 

ii.  A  nonzero  vector  A  €  relint  AC  (the  relative  interior  of  AC)  is  called  a  separating 
vector  for  G. 

iii.  A  separating  vector  A  for  which  the  scalar-weighted  graph  (V,  E,  fT  A)  has  only 
positive  cycles  is  called  a  witness  for  G. 
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A  witness  proves  the  nonexistence  of  nontrivial  zero- circulations.  Although  for  this 
purpose  the  vector  does  not  have  to  be  in  relint  AC,  we  add  this  as  a  requirement 
which  is  helpful  in  the  recursion. 

Remark  2.4.3  The  cone  AC  can  be  described  as  the  projection  on  the  A-space  ( Rd )  of 
a  cone  in  RTl+d  (the  space  of  (7 rj, . . . ,  7rn.  A))  which  is  characterized  by  the  inequalities: 

iti  -  Vj  +  XTCij  >  0  ((*,i)  6  E)  . 

Note  that  the  system  of  inequalities  above  is  the  linear  programming  dual  of  the 
zero- circulation  problem. 

Definition  2.4.4 

i.  Given  G  =  ( V,E,f ),  denote  by  CIRC(G!)  the  set  of  all  circulation  values  c  = 
Ylij  CijXij  (where  x  =  ( x# )  is  a  circulation  in  G). 

ii.  Given  a  separating  vector  A  ^  0  (i.e.,  A  €  relint  AC),  denote  by  ORTH(G,  A) 
the  set  of  vectors  c  G  CIRC(G)  which  are  orthogonal  to  A. 

Note  that  CIRC(G')  is  a  convex  polyhedral  cone. 

Proposition  2.4.5  The  set  AC  is  precisely  the  set  of  vectors  A  such  that  A Tc  >  0  for 
allc€  CIRC(G). 

Proof:  For  any  circulation  x  and  any  set  of  scalars  7r», 

£(*<  -  1Tj)Xij  =  0  . 

ij 

If  the  (vector)  value  of  x  is  c,  then 

A Tc  =  £(A Tcn)xij  , 

By  Remark  2.4.3,  if  A  G  AC  then  A Tc  >  0.  Conversely,  if  Arc  >  0  for  all  c  G 
CIRC(G!),  then  obviously  there  are  no  negative  cycles  in  (V,  E,  fT A),  so  A  G  AC.  1 
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Theorem  2.4.6 

i.  ORTH(G,  A)  is  independent  of  X,  and  hence  will  be  denoted  by  ORTH(G).  In 
fact ,  ORTH(G)  is  the  lineality  space  o/CIRC(G).  (In  case  1C  =  {0},  define 
ORTH(G)  to  be  the  entire  Rd.) 

ii.  ORTH(G)  =  (lin/C)-1-,  that  is,  the  orthogonal  complement  of  the  linear  subspace 
spanned  by  K  (hence  it  is  a  linear  subspace). 

Proof:  The  proof  is  based  on  a  geometric  analysis  which  is  given  in  Section  2.5.  • 

The  zero-cycle  detection  algorithm  partitions  the  graph  recursively  into  node  dis¬ 
joint  subgraphs.  The  tree  structure  defined  by  this  partitioning  process,  with  sub¬ 
graphs  as  nodes,  is  referred  to  as  the  decomposition  tree  of  the  graph  G.  In  this 
partition,  the  subgraphs  are  the  connected  components  of  a  “maximal”  (in  the  sense 
of  the  number  of  active  edges)  zero-circulation.  This  definition  implies  that  a  zero- 
cycle  exists  in  G  if  and  only  if  a  zero-cycle  exists  at  least  in  one  of  the  subgraphs  which 
G  is  partitioned  into.  If  a  subgraph  is  not  partitioned  any  further,  it  is  a  “leaf”  of 
the  decomposition  tree,  and  for  this  subgraph  the  algorithm  determines  the  existence 
of  a  zero-cycle  directly.  In  [39]  and  [41]  this  partition  is  computed  by  solving  a  set 
of  linear  programming  problems  in  order  to  decide  for  each  edge  whether  or  not  it 
is  active  in  any  zero-circulation  in  G.  The  subgraphs  are  the  connected  components 
induced  by  the  active  edges.  In  this  chapter,  the  computation  of  the  partition  is  done 
differently  by  an  algorithm  that  gives  strongly  polynomial  time  bounds. 

For  a  given  graph  G  =  (V,  E,  f)  the  algorithm  first  tries  to  find  a  witness  (if  a 
witness  is  found  a  zero-cycle  does  not  exist  and  we  stop).  In  case  a  witness  does  not 
exist,  a  separating  vector  is  computed.  The  computation  of  a  witness  or  a  separating 
vector  is  done  by  using  the  parametric  minimum  cycle  algorithm  developed  in  Chap¬ 
ter  3.  The  algorithm  then  proceeds  to  compute  a  partition  of  G  using  the  separating 
vector  found  in  the  previous  step.  If  the  partition  has  only  Cne  subgraph,  it  is  shown 
that  a  zero-cycle  exists  in  G;  otherwise,  the  algorithm  proceeds  recursively  on  the 
subgraphs.  Note  that  a  witness  or  a  separating  vector  can  be  computed  by  solving 
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linear  programming  problems.  The  difficulty  is  to  find  a  strongly  polynomial  time 
solution. 

In  the  rest  of  this  section  we  first  discuss  the  two  subroutines  used  by  the  algorithm 
and  then  proceed  to  the  algorithm  itself  (Subsection  2.4.3).  The  first  subroutine  is 
the  computation  of  a  witness  or  a  separating  vector  (Subsection  2.4.1).  The  second 
(Subsection  2.4.2)  is  the  partitioning  of  the  graph  when  a  separating  vector  is  given. 

2.4.1  Computing  a  witness  or  a  separating  vector 

Problem  2.4.7  Given  is  a  graph  G  —  ( V,E,f ).  Find  a  witness  for  G  (see  Def¬ 
inition  2.4.2)  if  one  exists;  otherwise,  find  a  separating  vector  A  or  conclude  that 
no  such  vector  exists,2  and  provide  a  collection  C  of  circulations  with  vector-values 
c1,...,cr  along  with  a  set  of  positive  numbers  ai,...,ov  such  that  r  =  0(d), 
cone{ca, . . .  ,cr}  D  ORTH(G),  and  23i=i  a«c‘  =  ®- 

Remark  2.4.8  The  collection  C  is  used  to  compute  an  explicit  zero-cycle  if  one 
exists.  It  enables  us  to  construct  a  circulation  of  any  given  value  c'  €  ORTH(G). 
The  decision  problem  (existence  of  a  zero-cycle)  can  be  solved  even  if  C  is  not  given. 

Proposition  2.4.9  Problem  2.4-7  can  be  solved  using4hree  applications  of  the  para¬ 
metric  minimum  cycle  algorithm  on  G  with  d  —  1  parameters. 

Proof:  Deferred  to  Chapter  3.  ■ 

The  following  proposition  is  used  for  the  proof  of  Proposition  2.4.9. 

Proposition  2.4.10  Given  vectors  cl,...,cr  C  R 4,  and  a  subspace  S  C  R?,  the 
following  two  conditions  are  equivalent: 

i.  For  every  A  &  S, 

min{Arc1, . . . ,  Arcr}  <  0  . 


2Note  that  K  ^  9  since  0  6  K\  a  separating  vector  exists  if  and  only  if  K  ^  {0}. 
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it.  cone-fc1, . . .  ,cr}  3  S~  . 

Proof:  The  equivalence  follows  from  Farkas’  Lemma  (see  Proposition  2  4. 12).  First 
we  assume  (i)  and  show  that  (ii)  is  implied.  Consider  z  6  S1 .  If  a  vector  y  €  Rd  is 
such  that  yrz  <  0,  then  obviously  y  &  S.  The  latter,  combined  with  (i)  gives  the 
left  hand  side  condition  on  Farkas’  Lemma.  Therefore,  from  the  right  hand  side  we 
have  z  €  cone{ci, . . .  ,  c,}. 

We  show  that  (ii)  implies  (i).  Assume  that  z  £  Sx  =>  z  €  cone{ci, . . .  ,cr}.  It 
follows  from  Farkas’  Lemma  that  for  all  z  6  Sx,  we  have  (Vy  €  Rd)yTz  <  0  => 
min{yTCj}  <  0.  Consider  a  vector  A  £  S.  There  must  exist  z  €  S1  such  that 
zT A  <  0  (otherwise,  Vz  €  S±,zTy  =  0  in  contradiction  to  A  £  5).  We  have 
zTA  <  0.  Therefore  it  follows  from  the  left  hand  side  of  Farkas’  Lemma  that 
min{ArCj}  <  0.  i 

Corollary  2.4.11  Let  the  vectors  c1 , . . . ,  cr  be  circulation  values.  If  for  every  vector 
A  lin  K,  min{ATc1, . . . ,  Arcr}  <  0,  then  cone{c1, . . .  ,cr}  3  ORTH(G). 

Proof:  Take  S  =  lin/C,  and  recall  from  Theorem  2.4.6  part  (ii)  that  ORTH(Gr)  = 
(lin/C)-1-.  | 

Proposition  2.4.12  [ Farkas ’  Lemma  [22]]  For  any  vectors  z,Ci  €  R? ,  i  =  1, . . .  ,r, 
(Vy  €  K*){yTz  <  0  =*►  min{yrCj}  <  0)  z  €  cone(ci)  . 

2.4.2  Computing  the  partition 

After  computing  a  separating  vector,  the  zero-cycle  detection  algorithm  proceeds  to 
compute  a  partition  of  the  graph.  In  this  subsection  we  define  this  partition,  and 
discuss  some  of  its  properties.  We  also  present  the  algorithm  that  computes  the 
partition  when  the  separating  vector  is  given. 

The  essence  of  the  following  proposition  is  mentioned  in  [41]. 
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Proposition  2.4.13  Let  G  =  (V,E,w)  be  a  scalar-weighted  graph  with  no  negative 
cycles.  Using  one  application  of  an  all-pairs  shortest  path  algorithm  we  can  find 
vertex  disjoint  subgraphs  G\,...,Gq  of  G  with  the  following  properties.  Edges  or 
vertices  that  are  not  active  in  any  zero-cycle  of  G  are  not  contained  in  any  of  the 
G{ ’s.  Two  vertices  u  and  v  are  in  the  same  G{  if  and  only  if  there  exists  a  (scalar) 
zero-cycle  of  G  in  which  both  u  and  v  are  active. 

Proof:  Apply  an  all-pairs  shortest  path  algorithm  to  compute  the  distance  duv 
between  all  pairs  of  vertices  u,  v  6  V.  Two  vertices  u,v  are  in  the  same  subgraph 
G{  if  and  only  if  duv  +  <£„„  =  0.  If  dvv  >  0,  then  v  is  not  a  part  of  a  zero-cycle  and 
does  not  belong  to  any  G 

In  order  to  identify  all  the  edges  that  participate  in  some  zero-cycle,  do  the  following. 
Select  arbitrarily  some  vertex  w  and  use  a  single-source  shortest  path  algorithm  to 
compute  the  distances  irv  (v  €  V)  from  the  vertex  w  to  all  other  vertices.  For  every 
edge  (u,  v )  define, 

Suv  —  7TU  7TV  •+  duv  ^  0  . 

Determine  that  {u,v)  is  an  active  edge  if  and  only  if  6UV  =  0.  | 

Remark  2.4.14  Each  component  of  the  partition  of  Proposition  2.4.13  contains  a 
zero-cycle  where  all  the  vertices  of  the  component  are  active.  This  zero-cycle  can  be 
constructed  easily  from  the  shortest  paths. 

Proposition  2.4.15  Suppose  A  is  a  separating  vector  of  G  =  (V,  E,  /).  Consider 
the  scalar  weights  w  =  fT A  on  the  edges  of  G.  Observe  that  by  the  definition  of 
a  separating  vector,  there  are  no  negative  cycles  in  the  scalar  weighted  ( V,E,w ). 
Let  Gi, . . . ,  Gq  be  the  partition  of  G  into  subgraphs  as  defined  in  Proposition  £.4. IS, 
relative  to  the  scalar  weights  w.  Under  these  conditions,  a  (vector)  zero-cycle  exists 
in  G  if  and  only  if  a  (vector)  zero-cycle  exists  in  one  of  the  Gi ’s. 

Proof:  The  ‘if’  part  is  trivial.  For  the  ‘only  if’  part,  suppose  x  is  a  (vector) 
zero-cycle  of  G  =  (V,  E,  /).  Then  x  is  a  scalar  zero-cycle  of  (V,E,fT A).  By 
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the  definition  in  Proposition  2.4.13,  all  the  vertices  active  in  x  are  in  the  same 
component  G{  and  hence  a:  is  a  vector  zero-cycle  of  G{.  \ 

Proposition  2.4.16  If  CIRC(G)  contains  a  nontrivial  linear  subspace  then  a  non¬ 
trivial  (vector)  zero- circulation  exists  in  G. 

Proof:  The  proof  is  immediate.  | 

Proposition  2.4.17  If  C  =  {ci,...,cr}  C  Rl  and  cti  >  0  (i  =  1, . . .  ,r)  are  such 
that  aici  =  0,  then  for  any  v  €  Rl ,  it  takes  0(£2r)  time  either  to  find  nonneg¬ 
ative  rational  constants  pi , ,(3r  such  that  v  =  £/3,-Ci,  0T  recognize  that  no  such 
constants  exist. 

Proof:  Express  v  as  a  linear  combination  of  the  vectors  in  C  by  solving  the  linear 
system  of  equations  v  =  £7 jCj.  This  system  has  l  equations  and  r  variables,  and 
thus  can  be  solved  by  Gaussian  eliminations  using  0(£2r)  operations.  If  7i  are 
nonnegative  take  /3t  =  7^;  otherwise,  denote  a  =  mini<j<T  a*,  7  =  min1<j<r7i  and 
let  =  7;  —  (7 /a)ctj.  It  is  easy  to  verify  that  Pi  (1  <  i  <  r)  are  nonnegative  and 
£'=1  PiCi  =  0.  | 

Proposition  2.4.18  Let  A  be  a  separating  vector  of  G  —  (V,  E,  f).  Let  G\, . . . ,  Gq 
be  the  partition  of  G  into  subgraphs  (as  defined  in  Proposition  2.4-1S),  relative  to  the 
scalar  weights  fT  A.  If  the  partition  constitutes  a  single  subgraph  (i.e,  q  -  \),  then  G 
has  a  (vector)  zero-cycle. 

Proof:  If  G  has  a  single  component  relative  to  fT A,  then  all  active  vertices  and 
edges  are  contained  in  G\.  Observe  that  all  cycles  with  vector  value  in  ORTH ((7) 
are  scalar  zero-cycles  relative  to  fT  A.  There  exists  a  scalar  zero-cycle  in  (V,  E,  fT  A) 
in  which  all  the  vertices  of  G\  are  active.  Thus,  there  exists  a  value  c  €  ORTH(Gf) 
which  is  attained  at  a  circulation  where  all  the  vertices  in  G\  are  active,  so  this 
circulation  is  connected.  By  Theorem  2.4.6  and  Proposition  2.4.16,  there  exists  a 
circulation,  not  necessarily  connected,  whose  value  is  — c.  The  active  vertices  in 
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G  = 

Separating  Vector:  A  =  (1,1) 

_ y 

G\  =  ♦  G%  = 

(-2,2) 

Witness  Vector:  A  =  (—1, 1)  Witness  Vector:  A  =  (1,  —1) 

Figure  2.3:  Example  of  the  decomposition  of  a  graph  G 

this  circulation  must  be  contained  in  G\.  By  combining  the  connected  circulation 
supporting  c  with  the  one  supporting  — c,  we  obtain  a  connected  (nontrivial)  zero- 
circulation,  that  is,  a  zero-cycle  of  G.  i 

Remark  2.4.10  Suppose  we  have  a  set  C  of  (vector)  cycle  values  such  that  coneC  D 
ORTH(G!).  The  vector  zero-cycle  of  Proposition  2.4.18  can  be  explicitly  constructed 
as  follows.  We  compute  a  connected  zero-circulation  relative  to  the  scalar  weights 
fT A,  in  which  all  the  vertices  are  active.  The  vector  value  of  this  circulation  is 
c  €  ORTH(G)  (see  Remark  2.4.14).  It  follows  from  Proposition  2.4.17  that  we  can 
construct  a  circulation  with  value  — c.  The  combination  of  the  two  circulations  is  a 
connected  zero-circulation. 

Remark  2.4.20  Assume  the  graph  G  does  not  have  a  separating  vector  (that  is, 
ORTH(G)  =  R?).  If  we  are  given  a  set  C  of  cyde  values  whose  conic  hull  equals 
/F*,  then  a  zero-arculation  can  be  constructed  as  follows.  Find  a  cyde  in  which  all 
the  vertices  are  active  (G  is  strongly  connected).  Denote  the  value  of  this  cycle  by 


(1,2)  (-1,2) 
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c.  It  follows  from  Remark  2.4.14  that  we  can  find  a  circulation  with  value  —  c.  The 
combination  of  the  two  circulations  is  a  (nontrivial)  connected  zero-circulation. 

Remark  2.4.21  Remarks  2.4.19  and  2.4.20  discuss  the  construction  of  an  explicit 
zero-cycle.  Observe  that  if  C  is  of  size  0(d),  then  the  time  complexity  of  constructing 
a  zero-cycle  is  0(dz )  (see  Proposition  2.4.17). 

Proposition  2.4.22  A  witness  for  G  exists  if  and  only  if  G  does  not  have  a  non¬ 
trivial  zero-circulation. 

Proof:  The  proof  is  immediate,  i 


2.4.3  The  algorithm 

Algorithm  2.4.23  [zero-cycle  detection] 

i.  Run  an  algorithm  for  Problem  2.4.7  on  G  (see  Proposition  2.4.9).  If  a  witness 
for  G  is  found  then  stop.  Otherwise,  find  a  collection  C  of  circulation  values  such 
that  coneC  D  ORTH(G),  and  either  find  a  separating  vector  A  or  conclude  that 
none  exists.  In  the  latter  case,  conclude  that  a  connected  zero- circulation,  and 
hence  a  zero-cycle,  exist  in  G  (see  Remark  2.4.20  for  an  explicit  construction  of 
the  zero-cycle).  Otherwise, 

ii.  Construct  the  partition  of  G  which  is  defined  in  Propositions  2.4.13  and  2.4.15. 
If  the  partition  is  empty  then  G  does  not  have  a  zero-cycle.  Otherwise, 

iii.  If  there  is  only  one  component  (i.e.,  q  =  1),  then  by  Proposition  2.4.18,  G  = 

has  a  zero-cycle  (see  Remark  2.4.19  for  how  to  find  the  zero-cycle 
explicitly). 

iv.  Run  the  zero-cycle  detection  algorithm  on  Gi, . . .  ,G,  (recursively).  By  Propo¬ 
sition  2.4.15,  G  has  a  zero-cycle  if  and  only  if  at  least  one  of  Gi, . . . ,  G,  has 


one. 
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In  the  rest  of  the  present  section  we  prove  the  correctness  and  analyze  the  com¬ 
plexity  of  Algorithm  2.4.23. 

Proposition  2.4.24  If  G  is  partitioned  into  G\, . . . ,  Gq  (see  Proposition  2-4-15)  and 
for  some  Gi,  dim(ORTH(Gi))  =  dim(ORTH(Gf));  then  Gi  will  not  be  partitioned  any 
further  by  the  algorithm. 

Proof:  Since  ORTH(Gi)  C  ORTH(G!),  equality  of  dimension  implies  equality  of 
the  sets,  so  a  separating  vector  for  G  is  a  separating  vector  for  G ;.  i 

Corollary  2.4.25  Algorithm  2. 4-23  terminates  after  at  most  d  —  1  phases  of  parti¬ 
tioning. 

Proposition  2.4.26  The  time  complexity  of  the  zero-cycle  detection  algorithm  for  a 
graph  G  =  (V,  E,  f )  (where  f  is  d-dimensional)  is  dominated  by  the  complexity  of  Zd 
applications  of  solving  Problem  2.4-7  onG. 

Proof:  First,  observe  that  the  complexity  of  explicitly  constructing  a  zero-cycle 
(see  Remark  2.4.21)  is  dominated  by  the  complexity  of  the  rest  of  the  algorithm. 
Consider  the  recursion  tree  of  Algorithm  2.4.23.  The  recursion  tree  corresponds  to 
the  decomposition  tree  of  the  graph  G.  By  Corollary  2.4.25  this  tree  has  d  levels. 
Each  level  is  a  phase  of  partitioning  a  collection  of  subgraphs  G\, . . .  ,Gq,  with 
total  number  of  n  =  |V|  vertices.  The  total  computation  done  at  such  a  phase  is 
solving  Problem  2.4.7  for  each  subgraph  Gi,  and  then,  if  needed,  partitioning  it  as 
described  in  Proposition  2.4.13.  Observe  that  the  time  and  processor  complexities 
of  solving  Problem  2.4.7  and  partitioning  all  the  subgraphs  at  a  certain  phase, 
are  dominated  by  the  complexities  of  the  same  computation  done  on  the  graph  G. 
Recall  (see  Proposition  2.4.13)  that  a  partitioning  operation  amounts  to  an  all-pairs 
shortest  path  computation.  Therefore,  the  complexity  of  computing  the  partition  is 
dominated  by  the  complexity  of  solving  Problem  2.4.7.  It  follows  that  at  each  level 
of  the  tree,  the  total  complexity  of  the  computation  is  dominated  by  the  complexity 
of  solving  Problem  2.4.7  on  G.  ■ 
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Theorem  2.4.27  The  complexity  of  the  zero-cycle  detection  algorithm  for  a  graph 
G  =  ( V,E,f )  (where  f  is  d- dimensional)  is  essentially  dominated  by  3 d  applications 
of  the  parametric  minimum  cycle  algorithm  of  Chapter  8,  applied  to  instances  with 
d  —  1  parameters  which  involve  the  graph  G. 

Proof:  The  proof  follows  from  Propositions  2.4.9  and  2.4.26.  | 


2.5  Geometric  lemmas 

In  this  section  we  give  the  necessary  lemmas  which  establish  the  proof  of  Theo¬ 
rem  2.4.6.  The  reader  is  referred  to  [31]  for  background. 

For  any  subset  C  of  Rd,  denote 

C+  =  {v:  (Vu  €  C)(»ru  >  0)}  . 

Recall  that  a  cone  which  does  not  contain  a  nontrivial  linear  space  is  said  to  be 
pointed. 

The  following  proposition  states  well  known  facts  about  cones  [30] . 

Proposition  2.5.1 

i.  Every  cone  C  is  a  direct  sum,  C  =  L  ©  of  a  linear  subspace  L  (the  lineality 
space  of  C)  and  a  pointed  cone  Cp. 

ii.  The  cone  Cv  is  contained  in  the  orthogonal  complement  of  L  in  linC. 
in.  dim(Cp)  =  dim(C)  -  dim(I). 

Proposition  2.5.2 

«.  If  L  C  R?  is  a  linear  subspace,  then  L+  =  L1. 

ii.  For  every  cone  C  we  have  C+  =  C+  fl  Lx ,  where  C  =  L  ©  Cp  as  above. 


2.5.  GEOMETRIC  LEMMAS 


29 


Proof:  The  proof  of  part  i  follows  from  the  fact  that  if  X  is  a  linear  subspace  and 
y  €  L+,  then  yTd  =  0  for  all  d  £  L.  Part  ii  follows  from  the  equality  C~  =  Cf  PL~ 
and  from  part  i.  | 

Proposition  2.5.3  If  C  is  a  pointed  cone,  C+  is  of  full  dimension. 

Proof:  The  following  claim  is  a  consequence  of  the  duality  theorem  of  linear  pro¬ 
gramming.  For  any  finite  set  of  vectors  u1, . . . ,  ur,  if  there  does  not  exist  a  vector 
a  =  (c*i, . . . ,  Or)T  >  0,  a  7^  0,  such  that  £  atu'  =  0,  then  there  exists  a  vector 
v  such  that  vTul  >  1,  i  =  1, . . .  ,r.  Thus,  if  C  is  a  pointed  cone  (not  necessarily 
polyhedral),  there  exists  a  vector  v  such  that  for  every  unit-vector  u  €  C,  vTu  >  1. 
It  follows  that  v  €  C+  and  there  exists  a  ball  B,  centered  at  r,  such  that  for  every 
w  £  B  and  u  £  C  (u  ^  0)  we  have  wTu  >  0.  This  implies  that  B  C  C+.  | 

Proposition  2.5.4 

dim(C+)  =  dim(X1)  . 

Proof:  It  follows  from  Proposition  2.5.3  that  C+  is  of  full  dimension  in  the  space 
lin  Cp.  Recall  that  lin  Cp  C  L+ .  The  proof  follows  from  Proposition  2.5.2  part  ii.  | 

Proposition  2.5.5  If  A  €  relint(C7+),  then  for  all  c  €  Cp  (c  /  0),  A Tc  >  0. 

Proof:  From  the  proof  of  Proposition  2.5.3  and  Proposition  2.5.4  it  follows  that 
there  exists  a  vector  v  such  that  for  every  unit-vector  u  €  Cp,  vTu  >  1,  and  for 
every  to  €  X,  vTw  =  0.  The  set  C+  is  full  dimensional  relative  to  L1.  Therefore, 
if  A Tc  =  0  for  some  c  €  Cp  (c  ^  0),  then  A  0  relint  C+.  i 

Let  C  =  CIRC(G!)  (see  Definition  2.4.4).  Let  X  and  Cp  be  as  in  Proposition  2.5.1. 
Let  K  be  as  in  Definition  2.4.2. 


Proposition  2.5.6 


K  =  C+ 
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Proof:  Tmmediate  from  Proposition  2.4.5.  | 


Proposition  2.5.7  For  every  A  £  relint(/C),  the  set  ORTH(G,A)  is  equal  to  the 
linear  subspace  L. 


Proof:  It  follows  from  Propositions  2.5.5  and  2.5.6,  that  if  A  €  relint  K,  and  c  6  C 
are  orthogonal,  then  c  €  L.  On  the  other  hand,  since  K,  C  Lx  (see  Proposi¬ 
tion  2.5.2),  if  c  €  L  and  A  6  /C,  then  A Tc  =  0.  | 


2.6  Applications  of  the  zero- cycle  detection  algo¬ 
rithm 

We  first  introduce  some  notation  for  the  discussion  of  periodic  graphs  (see  Section  2.1). 
For  a  given  G  =  (V,  E,  f )  where  f  :  E  —>  Zd  and  V  =  {1, . . .  ,n},  denote  by  G"  = 
( V~,E" )  the  infinite  periodic  graph  that  is  defined  by  G  as  explained  in  Section  2.1. 
We  refer  to  G"  as  a  d-dimensional  periodic  graph.  Formally, 

V'=ZdxV  =  {(*,.)  :  z€Z‘,i<=V}-, 

E-  =  Zd  x  E  =  {(*,.)  :  *  €  Z*t  e  €  25}  . 

If  e  =  (*,  j)  we  also  identify  the  edge  (z,  e)  with  the  pair  ((z,i),  (z  +  /(e),  jf)). 

The  zero-cycle  detection  algorithm  of  Section  2.4  computes  the  decomposition 
tree  of  an  input  graph  G  =  (V,  E,  f)  and  the  separating  vectors  for  all  the  subgraphs 
sitting  at  the  nodes  of  this  tree.  Recall  that  this  computation  can  be  performed 
by  solving  polynomially  many  LP  programs.  In  Section  2.4  we  presented  strongly 
polynomial  time  solution  when  the  dimension  d  is  fixed.  We  discuss  two  problems 
which  can  be  solved  easily  when  the  decomposition  tree  and  the  separating  vectors 
are  given. 
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2.6.1  Scheduling 

The  first  application  is  the  problem  of  scheduling  a  system  of  uniform  recurrence 
equations.  The  problem  was  raised  by  Karp,  Miller  and  Winograd  [39]  and  algorithms 
that  solve  it  were  given  in  [39,  56,  55].  These  algorithms  are  stated  in  terms  of  solving 
systems  of  linear  inequalities  and  therefore  do  not  establish  strong  polynomiality. 
We  show  that  the  knowledge  of  the  decomposition  tree  and  the  separating  vectors 
enables  us  to  produce  an  immediate  solution.  Hence,  we  obtain  strongly  polynomial 
complexity  bounds. 

A  system  of  uniform  recurrence  equations  is  a  finite  set  of  relations  among  func¬ 
tions  F{  :  Zd  — ♦  R  (i  =  1, . . . ,  n), 

Fi(z)  =  UFi(z  -  Cil), . . . ,  Fn(z  -  cin ))  . 

(The  definition  can  be  easily  extended  to  accommodate  the  case  where  the  value  of 
some  Fi  is  related  directly  to  more  than  one  value  of  some  Fj.)  Such  a  system  can  be 
modeled  by  a  finite  graph  G  =  ( V,E,f ),  where  the  functions  Fi  correspond  to  the 
vertices.  It  is  called  the  “dependence  graph”  in  [39].  This  graph  defines  a  periodic 
graph  Gm  whose  vertices  correspond  to  the  function  values  jF;(z),  (i  =  1, . . .  ,n,  z  € 
Zd,  z  >  0).  The  direct  dependencies  among  function  values  are  modeled  by  the  edges 
of  GM  as  follows.  If  e  =  (i,j)  €  E  and  /(e)  =  a,  then  the  evaluation  of  F<(z)  requires 
the  knowledge  of  Fj(z  —  a)  (and  having  all  the  required  knowledge  is  sufficient).  For 
simplicity,  suppose  it  takes  one  time  unit  to  evaluate  the  V't’s>  that  is,  given  all  the 
required  knowledge,  it  takes  one  time  unit  to  calculate  the  function  value. 

For  “efficient”  parallel  evaluation  of  the  function  values,  one  would  like  to  find 
large  sets  of  “independent”  values,  that  is,  sets  of  values  that  can  be  computed  simul¬ 
taneously.  Here,  two  values  are  independent  if  there  is  no  directed  path  in  Gm  between 
their  corresponding  vertices.  A  set  of  values  is  called  independent  if  every  two  mem¬ 
bers  of  the  set  are  independent.  The  problem  of  finding  a  maximal  independent  set  of 
values  is  not  easy,  since  the  problem  of  deciding  whether  there  exists  a  directed  path 
in  G‘,  from  (z1,*1)  to  (z2,tJ)  is  MV- Complete,  even  for  one-dimensional  periodic 
graphs  (see  [51]). 
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A  subspace  Si  C  li  is  said  to  be  independent  if  the  values  F^(z)  (z  c  Si)  are  in¬ 
dependent.  Interestingly,  one  can  compute  in  polynomial  time  maximal  independent 
subspaces  [39,  56].  Let  Vj  =  Zd  x  {i},  i  =  1, . . .  ,n.  A  maximal  independent  subspace 
Si  gives  a  partition  of  Vi  into  independent  “isomorphic”  flats  Sv  (t>  €  Sf),  where 
S.  =  (*,0)  +  5i  =  {(v  +  z,i)  |  z  6  Si}. 

The  algorithm  for  maximal  independent  subspaces  finds  for  each  i,  i  =  1, . . . ,  n,  a 
matrix  Mi  of  dimensions  (d  —  dim(Si))  x  d,  whose  rows  are  linearly  independent,  and 
whose  null  space  is  Si.  Following  [56],  the  matrix  Mi  is  called  the  scheduling  matrix 
of  i. 

An  algorithm  that  computes  the  scheduling  matrix  for  the  special  case  where 
the  decomposition  tree  of  G  is  of  depth  one  was  given  in  [39].  In  this  special  case, 
assuming  that  G  is  strongly  connected,  the  scheduling  matrix  would  be  the  same  for 
all  i.  In  fact,  M  —  M\  =  •  •  •  =  Mn  consists  of  a  single  vector  v  £  i?*,  which  is 
computed  by  solving  a  set  of  linear  programming  problems.  Obviously,  the  null  space 
of  v  is  of  dimension  d  —  1.  Any  solution  of  the  set  of  linear  programs  used  in  [39]  is 
in  the  interior  of  the  set  {r  |  (Vc  £  CIRC)(rrc  >  0)}.  Observe  that  every  such  v 
is  a  separating  vector  (see  Definition  2.4.2)  for  G.  Moreover,  it  is  a  witness  since  the 
decomposition  tree  has  depth  one.  A  more  formal  statement  follows. 

Proposition  2.6.1  Suppose  G  =  ( V,E,f )  is  strongly  connected  and  G~  has  no  cy¬ 
cles.  If  the  decomposition  tree  of  G  is  of  depth  one  and  v  is  a  separating  vector  (and 
hence  a  witness)  for  G,  then  the  null  space  of  v  is  a  maximal  independent  subspace. 

Proof:  The  null  space  of  v  has  dimension  d  —  1.  Therefore,  if  it  is  independent,  it 
must  be  maximal.  It  remains  to  show  that  the  null  space  of  v  is  independent.  First, 
we  claim  that  for  any  subspace  S,  if  S  n  CIRC  =  {0},  then  for  all  t  (t  =  1, . . .  ,n) 
and  for  all  u  £  Sx,  the  set  {(tt  +  z,i)  \  z  £  S}  is  independent.  To  prove  this 
claim,  assume  to  the  contrary  that  for  some  6  0  in  5  and  some  t,  there  exists 

a  directed  path  in  G’  from  («,»)  to  (u  +  6,*).  Thus,  there  is  a  cycle  in  G  with 
vector  weight  b,  which  implies  b  €  CIRC,  and  hence  a  contradiction.  Second,  we 
claim  that  the  intersection  of  the  null  space  of  any  witness  v  with  CIRC  is  equal  to 
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{0}.  To  prove  the  second  claim,  observe  that  if  witness  exists,  then  the  cone  CIRC 
of  possible  circulation  values  is  pointed.  Therefore,  since  ORTH(G)  is  the  lineality 
space  of  CIRC(G)  (see  Theorem  2.4.6),  we  have  dim(0RTH(G))  =  0.  Observe  that 
ORTH(G)  is  the  intersection  of  the  null  space  of  any  separating  vector  with  CIRC 
(see  Definition  2.4.4).  Assuming  the  second  claim  holds,  the  first  claim  implies  that 
S(v)  is  an  independent  subspace.  This  concludes  the  proof  of  the  proposition.  | 

Roychowdhury  and  Kailath  [56]  generalized  the  result  of  [39]  and  gave  an  algo¬ 
rithm  which  computes  the  scheduling  matrices  for  any  dependence  graph  G,  where  the 
decomposition  tree  is  not  necessarily  of  depth  one.  In  the  general  case,  the  scheduling 
matrices  Mi  (i  =  1, . . .  ,n)  need  not  be  all  identical,  or  even  of  the  same  dimension. 
Their  algorithm  first  computes  the  decomposition  tree  of  G,  along  with  the  separating 
vectors  of  the  subgraphs  sitting  at  the  nodes  of  the  decomposition  tree.  Subsequently, 
the  algorithm  uses  these  separating  vectors  to  construct  the  scheduling  matrices.  The 
latter  construction  is  trivial  (see  Definition  2.6.2  and  Proposition  2.6.3). 

The  algorithm  of  Roychowdhury  and  Kailath  [56]  (like  the  algorithm  for  the  depth 
one  case  of  [39])  is  based  on  solving  0(m)  sets  of  linear  inequalities  and  therefore, 
does  not  establish  strong  polynomiality.  Recall  that  the  zero-cycle  detection  algorithm 
computes  the  decomposition  tree  of  G  along  with  a  collection  of  separating  vectors 
that  correspond  to  the  subgraphs  of  G  sitting  at  the  nodes  of  the  decomposition  tree. 
Hence,  the  results  obtained  here  imply  that  the  scheduling  matrices  can  be  computed 
within  the  time  bounds  of  the  zero-cycle  detection  algorithm,  that  is,  in  A fC  and 
strongly  polynomial  time. 

When  the  decomposition  tree  of  G  and  the  separating  vectors  at  its  nodes  are 
given,  it  is  easy  to  compute  the  scheduling  matrices  [56]: 

Definition  2.6.2  Let  G  =  (V,  E ,  f)  be  a  dependence  graph,  where  V  =  {1, . . .  ,n}. 
Consider  the  decomposition  tree  of  G,  and  the  separating  vectors  of  the  subgraphs 
sitting  at  the  nodes  of  the  tree.  For  each  vertex  i  €  V”,  consider  the  set  of  subgraphs 
that  are  sitting  in  the  decomposition  tree  and  of  which  t  is  a  member.  This  set  of 
subgraphs  corresponds  to  a  path  in  the  decomposition  tree.  Define  the  path  of  a 
vertex  i  to  be  the  ordered  set  of  subgraphs  along  this  path. 
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Proposition  Au.3  For  a  given  dependence  graph  G  =  (I  r.E,f),  the  scheduling  ma¬ 
trix  Mi  of  a  vertex  i  is  the  matrix  whose  rows  are  the  separating  vectors  of  the  sub¬ 
graphs  along  the  path  of  i. 


2.6.2  Strong  connectivity 

Another  application  of  the  zero-cycle  detection  algorithm  is  the  following.  Given  a 
dependence  graph  G  ~  (V,E,  /),  compute  the  strongly  connected  components  of  G’, 
that  is,  find  graphs  Gx  =  ( V^E^fi )  such  that  the  graphs  G’  are  isomorphic  to  each 
of  the  strongly  connected  components  of  G".  The  problem  of  strong  connectivity  on 
periodic  graphs  was  first  raised  by  Orlin  [51].  However,  his  paper  is  concerned  only 
with  one- dimensional  periodic  graphs  (i.e.,  when  /  :  E  — »  Z  is  a  scalar  function). 
Orlin  gave  an  algorithm  for  the  one-dimensional  case  that  does  not  seem  to  generalize 
to  higher  dimensions.  This  is  not  surprising  since  the  strong  connectivity  problem  is 
obviously  at  least  as  hard  as  zero-cycle  detection  in  G :  the  periodic  graph  G~  does 
not  have  a  cycle  if  and  only  if  it  has  no  nontrivial  strongly  connected  components. 
We  show  that  the  zero-cycle  detection  algorithm  can  be  used  to  compute  the  strongly 
connected  components  of  Gm.  More  specifically,  we  prove  that  the  strongly  connected 
components  are  the  connected  components  of  the  subgraphs  sitting  at  the  leaves  of 
the  decomposition  tree. 

The  relation  between  the  decomposition  tree  of  the  dependence  graph  G  and  the 
strongly  connected  components  of  G‘  is  given  by  the  following  proposition: 

Proposition  2.6.4  The  strongly  connected  components  of  G"  are  precisely  the  con¬ 
nected  components  of  the  graphs  G‘,  where  Gi  is  any  subgraph  of  G  sitting  at  a  leaf 
of  the  decomposition  tree  of  G. 

Proof:  The  proof  is  immediate  from  the  following  two  claims: 

The  first  claim  is  that  every  strongly  connected  component  of  G"  must  be  contained 
in  some  G J.  This  is  obvious  since  all  zero-cycles  of  G  must  be  contained  in  one  of 
the  Gi’e. 
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The  second  claim  is  that  every  connected  component  of  a  G”  is  strongly  connected. 
It  suffices  to  show  that  every  path  (fii,  e2, . . . ,  et)  in  G,  is  part  of  a  zero-cycle.  Since 
every  e,  (j  =  1, . . . ,  l)  participates  in  a  zero-cycle  C{ ,  the  cycle  (jj=1  C;  is  a  zero-cycle 
which  contains  the  path,  i 

Remark  2.6.5  A  strongly  connected  component  S  C  V"  of  G "  is  such  that  if 
(a,i),  (b,  i)  E  S,  then  (a  -I-  a(a  -  b),i)  €  S  for  any  integer  a. 

It  follows  from  Proposition  2.6.4  that  for  a  given  dependence  graph  G  =  (V,  E,  /), 
we  can  compute  a  collection  of  dependence  graphs  Gj  (t  =  1, . . .  ,r),  such  that  the 
graphs  ( Gi)~  are  isomorphic  to  the  strongly  connected  components  of  G".  This  is 
done  as  follows. 

Algorithm  2.6.6  [Strongly  connected  components  of  G’] 

i.  Compute  the  decomposition  tree  of  G.  Denote  by  Gi  (i  =  the  sub¬ 

graphs  sitting  at  the  leaves  of  the  decomposition  tree. 

ii.  For  each  Gi,  compute  a  dependence  graph  Gi,  such  that  (G,)'  is  isomorphic  to 
each  of  the  the  connected  components  of  G'. 

Step  i  of  the  algorithm  involves  the  computation  of  the  decomposition  tree,  that  is, 
the  zero-cycle  detection  algorithm.  Step  ii  involves  the  computation  of  the  connected 
components  of  the  G'’s.  An  algorithm  for  computing  connected  components  of  a 
periodic  graph  is  given  in  [8,  12]. 


2.7  Concluding  remarks 

The  obvious  open  question  that  arises  is  whether  Problem  2.1.1,  where  the  dimension 
d  is  part  of  the  input,  can  be  solved  in  strongly  polynomial  time,  and  whether  it  is 
in  the  class  MC.  It  is  interesting  to  note  the  following: 
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Proposition  2.7.1  The  problem  of  detecting  a  zero  cycle  (Problem  2.1.1)  is  V- 
complete,  and  also  the  general  linear  programming  problem  is  reducible  to  it  in  strongly 
polynomial  time. 

Proof:  The  general  linear  programming  problem  is  equivalent  (in  strongly  polyno¬ 
mial  time  and  an  MC  reduction)  to  the  problem  of  solving  the  following  system: 

Ax  =  0 

( S ) 

0  #  x  >  0  , 

where  A  €  i2mxn.  Consider  a  network  consisting  of  n  parallel  edges  from  vertex  s 
to  vertex  t  and  one  edge  from  t  to  s.  (It  is  a  trivial  matter  to  avoid  parallel  edges 
if  this  is  desired.)  Associate  with  the  i’th  edge  the  weight-vector  given  by  the  t’th 
column  of  A,  said  associate  with  the  reverse  edge  the  zero  vector.  The  existence 
of  a  nontrivial  zero  circulation  in  this  network  is  equivalent  to  the  existence  of  a 
solution  to  the  given  system  (5).  This  establishes  our  claim.  | 

In  view  of  Proposition  2.7.1,  the  questions  stated  in  the  beginning  of  this  section 
are  equivalent  to  two  famous  and  difficult  open  questions. 

Recall  that  we  considered  zero-cycle  which  were  not  necessarily  simple.  Unfortu¬ 
nately,  if  simplicity  of  the  cycle  is  added  to  the  requirements,  the  problem  becomes 
A/’P-complete.  Moreover,  even  the  problem  of  recognizing  whether  a  graph  with 
scalar  weights  has  a  simple  cycle  with  a  total  weight  of  zero  is  A/’P-complete.  This 
follows  Rom  the  fact  that  the  knapsack  problem  can  be  reduced  to  detecting  a  simple 
zero-cycle  in  a  graph  whose  edges  form  a  ring,  where  two  consecutive  vertices  axe 
connected  with  two  parallel  edges. 


Chapter  3 


Parametric  minimum  cycle 


This  chapter  is  concerned  with  the  parametric  extensions  of  the  minimum  cycle  and 
the  minimum  cycle-mean  problems.  In  this  problems,  we  consider  graphs  with  edge- 
weights  which  are  linear  functions  of  d  parameters.  The  goal,  roughly,  is  to  find  an 
assignment  of  the  parameters  such  that  the  value  of  the  optimal  cycle  is  maximized. 

Recall  from  Theorem  2.4.27  that  Problem  2.1.1  can  be  solved  within  the  same 
time  bounds  as  these  parametric  extensions.  The  proof  of  Proposition  2.4.9,  which 
establishes  this  relationship,  was  deferred  to  this  chapter. 

The  algorithms  presented  in  this  chapter  to  solve  the  parametric  minimum  cy¬ 
cle  problem  introduce  a  general  method.  This  method  is  applicable  to  parametric 
extensions  of  a  large  class  of  problems.  More  specifically,  this  method  introduces  a 
combinatorial  way  to  optimize  a  concave  function  in  fixed  dimension,  when  we  are 
given  a  piecewise  affine  algorithm  (see  Definition  3.1.2)  that  computes  this  function. 
The  latter  is  discussed  further  in  Chapter  4. 

This  chapter  is  organized  as  follows.  In  Section  3.1  we  give  some  definitions 
and  describe  the  general  setup.  We  also  give  the  proof  of  Proposition  2.4.9.  In 
Section  3.2  we  present  a  simplified  algorithm  for  the  parametric  minimum  cycle- 
mean  problem.  The  goal  of  this  presentation  is  to  give  the  reader  a  sense  of  how  the 
strongly  polynomial  time  bounds  are  achieved.  Details  which  axe  not  essential  for 
the  qualitative  result  of  strongly  polynomial  time  bounds  are  avoided.  The  reader 
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may  skip  Section  3.2,  since  the  succeeding  sections  are  independent.  In  Section  3.3  we 
give  an  algorithm  for  the  problem  of  parametric  minimum  cycle  with  at  most  n  edges. 
This  section  introduces  additional  ideas  whose  purpose  is  to  improve  the  sequential 
and  parallel  time  bounds. 


3.1  P  r  eliminar  ies 

We  start  by  giving  some  definitions  and  notations. 

Definition  3.1.1 

i.  For  a  finite  set  C  C  Rd,  denote  by  Lc  ■  K1  —*  R  the  lower  envelope  of  the 
linear  functions  that  correspond  to  the  vectors  in  C,  Lc(A)  =  minCec  crA.  The 
vectors  c  €  C,  and  interchangeably  the  linear  functions  cT A  are  referred  to  as 
pieces  of  Lc.  If  for  A'  £  IP1  and  c  €  C  we  have  crA'  =  Lc(A')  we  say  that  c  is 
active  at  A'. 

ii.  Suppose  H  C  R?  is  a  flat,  F  C  IP1  is  the  subspace  parallel  to  it,  and  C  — 
{ci, . . .  ,c,}  C  IP*  is  a  set  of  vectors.  A  balancing  combination  of  C  relative  to 
H  is  a  positive  linear  combination  a»c»  which  is  orthogonal  to  F.  If  H  =  IP* 
we  say  that  o^a  =  0  is  a  balancing  combination  of  C  if  oi, . . . ,  a,  >  0. 

Let  R$  denote  the  set  of  vectors  A  =  (Ai,  •  •  • ,  Aj)T  €  IP1  such  that  A*  =  6. 

Definition  3.1.2  For  g  :  R*  —*  R  (g  :  H  —*  R  where  H  C  R?  is  a  hyperplane),  we 

introduce  the  following  definitions  and  notations: 

i.  Denote  by  Ag  =  A  (possibly  A  =  0)  the  set  of  vectors  A  €  IP1  (A  €  H)  where 
<jr(A)  is  maximized. 

ii.  Denote  by  Kt  =  K  the  set  of  A  €  R1  (A  €  H)  such  that  g(A)  >  0.  Also,  denote 
by  Kf  the  set  of  A  €  K  fl  R$. 
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iii.  An  algorithm  that  computes  the  function  g  is  called  piecewise  affine,  if  the  op¬ 
erations  it  performs  on  intermediate  values  that  depend  on  the  input  vector  are 
restricted  to  additions,  multiplications  by  constants,  comparisons,  and  making 
copies. 

iv.  When  g  =  Lc  (C  C  Rd),  we  say  that  g'  =  Lc>  is  a  weak  approximation  of  g,  if  the 
pieces  of  g'  comprise  a  subset  of  the  pieces  of  g  (C1  C  C )  and  aff  Ag  =  aff  Ap<. 
The  function  g'  =  Lc  is  a  minimal  weak  approximation  of  g ,  if  there  is  no 
proper  subset  C"  of  C'  such  that  Lc  is  a  weak  approximation  of  g. 

We  assume  throughout  that  the  function  g  :  Rd~l  —*  R  is  concave  (sometimes  we 
denote  the  range  by  Rd).  We  also  assume  that  g  is  given  by  a  piecewise  affine  al¬ 
gorithm  A  that  evaluates  it  at  a  given  point.  The  parametric  minimum  cycle  and 
parametric  minimum  cycle- mean  problems  are  special  cases  of  the  following  problem. 
The  scheme  presented  here  to  solve  Problem  3.1.3  for  these  cases,  however,  can  be 
applied  to  any  concave  function  g  as  above.  This  is  discussed  further  in  Chapter  4. 

Problem  3.1.3  F  ^(A)  >  0  for  some  A,  then  output  such  A;  otherwise,  find  A  € 
relint  A.  We  sometimes  add  the  following  requirement.  If  g  <  0,  find  a  subset  C  of  the 
pieces  of  g,  such  that  Lc  is  a  minimal  weak  approximation  of  g,  and  find  a  balancing 
combination  of  C  relative  to  R[.  We  refer  to  this  last  task  as  the  “optional”  part  of 
the  problem. 

Intuitively,  the  set  C  certifies  that  the  function  g  does  not  exceed  its  maximum 
value.  The  advantage  of  considering  it  is  that  while  the  number  of  pieces  of  g  may 
be  vary  large,  the  size  of  a  minimal  weak  approximation  is  at  most  2d.  1 

We  discuss  Problem  3.1.3  where  the  function  g  results  from  parametric  extensions 
of  the  minimum  cycle  and  the  minimum  cycle-mean  problems.  In  an  instance  of 
a  parametric  problem  with  d  —  1  parameters,  edge  weights  are  generalized  to  be 
linear  functions  of  the  parameters.  For  each  assignment  of  values  to  the  parameters 
(vectors  in  J2f),  we  get  an  instance  of  the  nonparametric  problem.  The  function  g  is 
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defined  as  the  mapping  from  assignments  of  values  to  the  solution  of  the  corresponding 
nonparametric  problem. 

Let  G  ~  ( \’,E,f )  be  as  in  Problem  2.1.1.  The  vector-weights  /(e)  (e  €  E)  are 
interpreted  as  linear  functions  of  d  —  1  variables  (parameters): 

/(e)  =  Me) A,  +  •  •  •  +  U-  1(e)Arf.a  -  fd{e)  . 

When  A  is  assigned  with  specific  numerical  values,  we  denote  by  fT A  the  resulting 
set  of  scalar  weights. 

Parametric  minimum  cycle-mean  : 

Definition  3.1.4  Consider  G  =  ( V,E,f ),  where  for  (i,j)  £  E ,  f(i,j )  =  Cij  €  Rd, 

i.  For  a  subset  of  edges  E'  C  E,  denote  by  f(E')  the  ( d  —  Invariable  linear 
function  (l/|£'|)£e€£,/(e). 

ii.  Denote  by  g(\)  the  minimum  cycle-mean1  relative  to  the  scalar-weights  A T c^. 
Problem  3.1.5  [Parametric  Minimum  Cycle-Mean] 

For  a  given  graph  G  =  ( V,E,f ),  if  g( A)  >  0  for  some  A,  then  output  one  such  A; 
otherwise,  find  A  €  relint  A. 

Parametric  minimum  cycle  : 

Definition  3.1.6  Consider  G  =  (V,E,  /),  where  for  e  £  E,  /(e)  =  ce  £  R?, 

i.  For  E'  C  E,  denote  by  f(E')  the  ( d  —  Invariable  linear  function  /(e)- 

ii.  Let  C  =  C( A)  denote  a  cycle  of  at  most  n  edges  which  minimizes  the  total 
scalar  weight  Arc,.  Denote  #(A)  =  f(C)T A. 

1The  minimum  cycle-mean  is  the  minimum,  over  all  simple  cycles,  of  the  total  weight  of  a  cycle 
divided  by  the  number  of  its  edges. 
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Minimum  cycle  of  at  most  2  edges  in  G  Minimum  cycle-mean  in  G 
g{ A)  =  min{|A  -f  2,  A  +  4,  -A  +  4}  flr(A)  =  min{iA  +  2,  -|A  -f  2} 

Figure  3.1:  Example  of  g  for  a  graph  G  with  2-dimensional  weights 
Problem  3.1.7  [Parametric  Minimum  Cycle] 

For  a  given  graph  G  =  (V,  E,  / ),  if  $(A)  >  0  for  some  A,  then  output  any  such  A; 
otherwise,  find  A"  €  relint  A  and  a  collection  C  =  {Ci,  •  ■  • ,  Cr}  of  cycles  ,  each  of 
at  most  n  edges,  such  that  j  is  a  minimal  weak  approximation  of  g 

(see  Definition  3.1.2),  along  with  a  balancing  combination  of  the  cycle  values  f(C) 
relative  to  Rf. 

The  function  g  defined  for  both  problems  above  is  of  the  form  g  =  Lc,  where  C  is 
the  collection  of  all  possible  vector  values  of  cycle-means  (Problem  3.1.5)  or  cycles  of 
at  most  n  edges  (Problem  3.1.7).  Also  note  that  g  is  concave  and  computable  by  a 
piecewise  affine  algorithm  (see  Figure  3.1  for  an  example  of  such  functions).  The  par¬ 
allel  of  the  optional  part  in  Problem  3.1.3  is  omitted  in  the  statement  of  Problem  3.1.5. 
In  Problem  3.1.7,  however,  the  optional  part  corresponds  to  the  collection  of  cycles. 
We  explain  the  purpose  of  considering  the  optional  part.  Recall  that  in  Chapter  2 
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the  problem  of  detecting  zero-cycles  was  reduced  to  solving  Problem  2.4.7.  Solving 
the  latter  problem  amounts  to  (i)  computing  a  separating  vector,  which  is  needed 
to  solve  the  decision  problem,  and  (ii)  finding  a  collection  of  cycles,  •which  is  used 
for  computing  an  explicit  zero-cycle  when  one  does  exist.  Proposition  2.4.9  tied  the 
solution  of  Problem  2.4.7  to  solving  instances  of  the  parametric  minimum  cycle  and 
the  parametric  minimum  cycle- mean  problems.  The  proof  of  Proposition  2.4.9  was 
deferred  to  the  current  chapter.  We  will  see  that  in  order  to  compute  a  separating  vec¬ 
tor  (and  hence  decide  existence  of  a  zero-cycle)  it  suffices  to  consider  Problems  3.1.5 
and  3.1.7  -where  the  optional  part  is  omitted. 

The  remainder  of  this  section  is  organized  as  follows.  In  Subsection  3.1.1  we  define 
an  important  subproblem  of  Problem  3.1.3.  In  Subsection  3.1.2  we  discuss  properties 
of  weak  approximations  and  balancing  combinations.  We  suggest  to  skip  Subsec¬ 
tion  3.1.2  in  first  reading.  In  Subsection  3.1.3  we  give  the  proof  of  Proposition  2.4.9, 
and  hence,  completing  the  reduction  of  the  zero-cycle  detection  problem  to  solving 
instances  of  Problems  3.1.5  and  3.1.7.  The  properties  given  in  Subsection  3.1.2  are 
used  for  solving  the  “optional”  part  of  Problem  3.1.3,  that  is,  to  solve  the  second  part 
of  Problem  2.4.7. 


3.1.1  The  oracle  problem 

We  find  the  following  subproblem  useful  for  the  solution  of  Problem  3.1.3.  The  goal  of 
Problem  3.1.3  is  to  maximize  a  function  over  some  domain.  Intuitively,  the  following 
is  a  useful  tool:  decide  on  which  side  of  a  given  query  hyperplane  (in  the  A  space) 
g(X)  is  either  maximized  or  unbounded.  We  refer  to  a  procedure  that  solves  this 
problem  as  an  orach.  Clearly,  such  an  “oracle”  would  enable  us  to  perform  binary 
searches  over  the  A  space.  In  order  to  achieve  strongly  polynomial  bounds,  however, 
we  use  a  more  sophisticated  approach  where  the  number  of  hyperplane  queries  needed 
is  of  the  order  of  the  number  of  comparisons  done  by  A.  Each  hyperplane  query  can 
be  resolved  by  an  oracle  call.  By  using  the  multi-dimensional  search  techniques  and 
exploiting  the  parallelism  of  A,  however,  we  can  do  better:  We  present  a  solution 
where  the  number  of  oracle  calls  performed  is  a  polylog  in  the  number  of  comparisons 
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(hyperplane  queries  needed). 

The  function  g  is  a  concave  piecewise  linear  mapping  from  Ri  into  R.  Concave 
functions  have  the  property  that  it  can  be  effectively  decided  which  side  of  a  given 
hvperplane  H  contains  the  maximum  of  the  function.  The  decision  can  be  made  by 
considering  a  neighborhood  of  the  maximum  of  the  function  relative  to  H,  searching 
for  a  direction  of  ascent  from  that  point.  This  principle  is  explained  in  detail  in  [47] 
and  is  developed  below  for  the  special  structure  of  our  problem. 

Problem  3.1.8  Given  is  a  concave  function  g  :  Rf  —*  R  and  a  hyperplane  H  in  the 
A-space. 

i.  Recognize  whether  there  exists  A  €  H  such  that  g{ A)  >  0,  and  if  so,  output 
any  such  A;  otherwise, 

ii.  find  A  €  H  D  relint(A),  if  such  a  A  exists,  and  generate  a  solution  of  Prob¬ 
lem  3.1.3  for  g\  otherwise,  if  H  D  relint  A  =  0, 

iii.  recognize  which  of  the  two  halfspaces  determined  by  H  either  intersects  relint  A, 
or  has  g  unbounded  on  it. 

We  refer  to  a  procedure  that  solves  Problem  3.1.8  as  an  oracle  and  to  the  hyperplane 
H  as  the  query  hyperplane. 

The  method  presented  here  solves  instances  of  Problem  3.1.3  by  running  a  “simula¬ 
tion”  of  the  algorithm  A,  where  (i)  additions  and  multiplication  are  replaced  by  vector 
operations,  and  (ii)  comparisons  are  replaced  by  hyperplane  queries.  Problem  3.1.8  is 
solved  by  three  recursive  calls  to  instances  of  Problem  3.1.3  on  functions  of  the  form 
g' :  jR?-1  — ♦  R.  For  a  point  A  €.  R$,  define  g^  =  Lc  as  the  function  whose  pieces  are 
all  the  pieces  of  g  =  Lc  which  are  active  at  A  €  R$  {C  =  {c  €  C  |  cTA  =  p(A)}). 
The  functions  to  which  the  recursive  calls  are  made  axe  restrictions  of  functions  of 
the  form  g ^  to  hyperplanes.  In  Chapter  4  we  show  that  a  piecewise  affine  algorithm 
that  computes  g  can  be  converted  to  a  piecewise  affine  algorithm  that  computes  g' 
and  uses  the  same  number  of  operations.  Algorithms  that  solve  Problem  3.1.8  for 
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the  parametric  minimum  cycle  and  the  parametric  minimum  cycle-mean  functions 
are  given  later  in  the  current  chapter. 


3.1.2  Geometric  lemmas 

We  discuss  some  properties  of  weak  approximations  and  balancing  combinations. 

Proposition  3.1.9  Suppose  that  g  =  Lc,  where  C  is  a  finite  set  of  vectors,  A ~  E  Ap, 
and  C'  =  {c  £  C  \  cT A"  =  <7(A)}.  The  function  Lc  is  a  weak  approximation  of  g. 

Proof:  Consider  g  in  a  neighborhood  of  A‘.  Since  C  is  finite,  there  exists  an  open 
neighborhood  N  of  A“  such  that  LC'{A)  =  g( A)  for  every  A  £  N.  Consider  a  vector 
A'  and  denote  by  L  the  open  line  segment  determined  by  A'  and  A".  Consider  a  point 
A"  £  LnN,  and  let  c  £  C'  be  a  piece  of  g  active  at  A".  Since  <?(A")  <  <7(A")  we  have 
<7(A')  <  ^(A").  Suppose  that  A'  afFA.  We  need  to  show  that  Lc{A')  <  g( A"). 
For  every  A  £  L,  g{A)  <  g(A'),  since  otherwise  we  have  A'  £  aff  A.  In  particular 
the  latter  holds  for  A".  It  follows  that  Lc'(A')  <  cTA'  <  g{A").  § 

Corollary  3.1.10  If  Lc  is  a  minimal  weak  approximation  of  g,  we  have  cT  A  =  flf(A) 
for  all  c  EC'  and  A  €  aff  A.  It  follows  that  Alc,  =  aff  A. 

Proposition  3.1.11  If  g  =  Lc  omd  for  some  A  €  relint  A  and  c  £  C  we  have 
cT A  =  g(A),  then  cTA  =  p(A)  for  all  A  £  aff  A. 

The  proof  is  immediate. 

Proposition  3.1.12  Suppose  g  =  L £,  where  C  C  R? ,  is  bounded  from  above.  Let 
C  =  {ci, . . . ,  cv}  C  C  be  a  set  of  pieces  of  g.  Under  these  conditions, 

i.  The  function  Lc  is  a  weak  approximation  of  g  if  and  only  if  con eC  3  (aff  Ap)x. 
If  Lc  is  a  minimal  weak  approximation  of  g,  con eC  =  (aff  A p)x. 
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it.  If  Lc  is  a  minimal  weak  approximation  of  g,  then  there  exist  positive  numbers 
ai, ...  ,OLr  such  that  £'=1  a.Ci  =  0  and  hence  lin C  =  (aff  Afl)~ . 

Hi.  If  ai, ...  ,OLr  as  in  (ii)  are  given  explicitly,  then  every  c  €  (aff  A)~  can  be  ex¬ 
pressed  as  a  nonnegative  linear  combination  of  vectors  in  C  using  O(cPr)  oper¬ 
ations. 

Proof:  The  function  L c-  is  bounded  from  above  on  Rd,  and  hence  is  nonpositive. 
Therefore,  0  6  A,  £c-(  A)  =  0,  and  aff  A  =  lin  A  is  a  subspace.  Part  (i)  follows 
from  Proposition  2.4.10.  The  equality  when  Lc  is  a  minimal  weak  approximation 
follows  from  Corollary  3.1.10.  Parts  (ii)-(iii)  are  direct  consequences  of  Part  (i)  and 
Proposition  2.4.17.  | 

Remark  3.1.13  Suppose  H  =  {A  €  Rd  |  aT X  =  (3}  ((3  >  0)  is  a  hyperplane, 
F  —  {A  €  Rd  |  aTA  =  0}  is  the  subspace  parallel  to  H,  and  A’  €  H  is  a  point  on 
the  hyperplane.  By  applying  standard  methods,  using  0(d ®)  operations  we  can  find 
an  affine  mapping  M  from  H  onto  Rd~ 1  that  maps  A'  to  0.  All  such  mappings  are  of 
the  form  M  :  H  —*  M( A)  =  L( A  —  A")  where  the  matrix  L  €  .ftO*-1)*4*  is  such 
that  L(F)  =  {LA  |  A  £  F}  =  Rd.  Within  the  same  time  bound  we  can.  compute  the 
matrices  L~l  €  which  map  Rf~l  onto  the  subspace  F,  and  F  €  R?*d  such 

that  for  y  €  R d,  Fy  is  the  projection  of  y  into  the  subspace  F. 

Proposition  3.1.14  Let  C  C  R?  be  a  finite  set  of  vectors,  and  suppose  Lc  '■  R?  — ► 
R  is  bounded.  Suppose  we  are  given  a  hyperplane  H  C  R? ,  along  with  a  point 
A*  €  Aic|H.  Under  these  conditions,  in  0(d?)  operations  we  can  compute  an  affine 
mapping  M  :  H  —*  Rf~l  (M( A*)  =  0),  and  a  linear  mapping  of  the  vectors  in 
C'  =  {c  €  C  |  cT A*  =  Lc( A)}  (the  pieces  of  Lc  which  are  “ active ”  at  Am)  into  vectors 
in  C  C  i^-1 .  These  mappings  convert  the  problem  of  computing  a  minimal  weak  ap¬ 
proximation  of  Lc  restricted  to  H  into  an  equivalent  problem  of  computing  a  minimal 
weak  approximation  of  £<*  :  R?"1  — ►  R. 


Proof:  We  use  the  notation  of  Remark  3.1.13.  It  follows  from  Proposition  3.1.9 
that  Lc  is  a  weak  approximation  of  Lc\h,  so  we  can  consider  only  the  vectors  in 


46 


CHAPTER  3.  PARAMETRIC  MINIMUM  CYCLE 


the  set  C' .  Let  K  =  FL~l ,  and  C  =  {c  |  c  €  C'}  where  c  =  cTK.  We  will  show 
that  (i)  the  set  C  is  such  that  for  all  A  €  H,  c  €  C',  cT A  =  crM( A)  -f  Lc( A"),  and 
(ii)  for  Y  C  C,  the  set  Y  C  C  is  such  that  Ly  is  a  minimal  weak  approximation  of 
Lj  if  and  only  if  Ly  is  a  minimal  weak  approximation  of  Lc\h- 

For  a  vector  y  €  Rd  denote  by  y'  =  yTa/ ||a|[  the  projection  of  y  into  the  subspace 
spanned  by  a,  and  denote  by  y  =  Fy  =  y  —  y'  the  projection  of  y  on  F.  Observe 
that  F  =  {&a\  /3  €  J?}x  C  Rd,  and  thus  for  all  x,  y  €  Rd,  xTy  =  xTy  -f  x'y' ,  and 
for  all  A  €  H  we  have  A'  =  (A")'. 

For  c  €  C',  define  c  =  cT L~l  =  cK.  Consider  vectors  c  £  C'  and  A  €  if.  Recall 
that  cT A‘  =  Xc(A'),  and  therefore  we  need  to  show  that  cT( A  —  A")  =  cTM( A). 
Note  that  ct(A  -  A‘)  =  cT( A  -  A")  +  c'(A'  -  (A‘)').  Since  A'  =  (A’)'  for  all  A  €  H, 
we  have  cT( A  —  A")  =  cTL~1L( A  —  A")  =  crM( A).  ■ 

Proposition  3.1.15  Under  the  conditions  of  Proposition  3.1.14  the  following  holds: 
A  set  of  vectors  {c1} . . . ,  c,}  C  C' ,  and  a  set  of  positive  numbers  a\, . . . ,  a,,  >  0 

satisfy  <*iCi  =  0,  if  and  only  if  OjCj  =  —7a  and  7  >  0.  Moreover,  7  >  0  if 
and  only  if  Lc\h  <  0.  The  scalars  a,-  are  independent  of  the  particular  choice  of  A' 
and  the  linear  mappings. 

Proof:  We  continue  to  use  the  notation  of  the  proof  of  Proposition  3.1.14.  Re¬ 
call  that  cL  =  c,  and  thus  (XiCi  =  (Z)i=1  a iC%)L  =  0.  Hence,  a<Cj 
=  £-=1  a,c'i  +  2isl  OLic'i  =  21=  1  c\-  Denote  c\  =  cja.  We  need  to  show  that 

£<=1  ^  0  and  the  inequality  is  strict  if  and  only  if  Lc(A")  <  0.  Recall  that 

for  all  c  €  C,  crA“  =  LC(A-).  Thus,  (^=1  a,c-)rA*  =  (£;=J  Oi^)aTA’  <  0, 
and  equality  holds  if  and  only  if  Lc(Am)  =  0.  To  conclude  the  proof,  observe  that 
A’  €  H ,  and  therefore  aT A*  =  /?  >  0.  1 

Proposition  3.1.16  Suppose  we  are  given  a  set  of  vectors  C  =  {ci,...,c,}  C  if* 
suck  that  C  spans  R*  and  a  balancing  combination  ofC  (see  Definition  3.1.1  part  ii). 
By  using  0(r(P)  operations  we  can  find  a  minimal  subset  C1  C  C  (C  |  <  2d)  that 
spans  R?  and  a  balancing  combination  forC. 


-r— - ^ 


3.1.  PRELIMINARIES 


47 


Proof:  Assume  that  the  vectors  Cj, . . .  .Cj  span  Rd.  In  order  to  find  a  solution  we 
use  an  iterative  step  with  the  following  properties.  If  I C  ^  2d,  we  find  a  balancing 
combination  for  C  \  C  where  C  C  {c^i, . . . ,  c2<f+i}  and  \C  1.  It  is  easy  to  verify 
that  after  repeating  this  step  at  most  r  —  2d  times,  we  get  a  oalancing  combination 
for  a  subset  C1  of  C,  which  spans  Rd  and  is  of  size  at  most  2d. 

The  iterative  step  is  as  follows.  Solve  the  linear  system  of  equalities  Ylitdii  &ici  — 
0.  The  vectors  Cj+i, . . . ,  c2<^i  are  linearly  dependent,  and  hence  this  system  is 
feasible.  Assume  without  loss  of  generality  that  for  at  least  one  index  i,  0 *  >  0.  Let 
h  =  {d  -f  1  <  i  <  2d  j  0i  >  0},  h  —  {d  +  1  <  i  <  2d  |  0i  <  0},  7  =  mini€j,  a*/#;, 
and  C  =  {Ci  \  i  e  li,oti/0i  =  7}.  We  have  '£.ieiluii  &&  =  0,  and  C  ±  0.  Consider 
a'  =  (ai,. . .  ,a')  where  a-  =  a<  (i  £  U  /2),  a-  =  ctj  -  7ft  (i  €  h  U  /2).  It 
is  easy  to  verify  that  a'{  >  0  and  a'{  =  0  if  and  only  if  C{  €  C.  It  follows  that 
Ec,ec-<5  a«c‘  =  £ci€C  <*ici  -  7  HieiiuJj  @ici  =  0.  Thus,  the  vectors  in  C  \  C  span 
R4,  and  Ec,ec-d  aici  a  balancing  combination. 

The  complexity  is  as  follows.  The  first  step  amounts  to  computing  d  independent 
vectors  {cj, . . . ,  c«z},  what  can  be  done  in  0(r<P)  time.  In  each  iterative  step  we 
solve  a  system  of  linear  equalities.  This  can  be  done  in  O(tP)  operations,  using 
Gaussian  eliminations.  Notice,  however,  that  in  two  consecutive  iterations,  the 
linear  systems  have  d  —  \C\  columns  in  common.  In  such  a  case,  when  the  matrix  of 
the  first  problem  is  given  in  upper  triangular  form,  the  second  system  can  be  solved 
in  only  0{\C\dP)  operations.  Thus,  the  total  number  of  operations  is  0(r<P).  % 

Corollary  3.1.17  Suppose  g  —  Ly  where  (Y  C  R?)  is  bounded  from  above.  Assume 
we  are  given  a  set  of  vectors  C  =  {c2, . . . ,  c,}  and  a  balancing  combination  of  C  such 
that  (i)  Lc  is  a  weak  approximation  of  g,  and  (ii)  there  exist  A*  €  relint  A  such  that 
for  all  c  €  C  we  have  cT X"  =  y(A“).  Under  these  conditions,  in  0(rd?)  operations 
we  can  compute  C'  C  C  such  that  Lc  is  a  minimal  weak  approximation  of  g,  and  a 
balancing  combination  ofC'.  The  size  of  such  C'  is  at  most  2 (d  —  dim(A)). 

Proof:  It  follows  from  Corollary  3.1.10  and  Propositions  3.1.11  that  for  all  c  €  C, 
c  €  (aff  A)x.  It  follows  from  Proposition  3.1.12  that  the  vectors  C  span  the  subspace 
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(aff  A)-1.  To  conclude  the  proof,  note  that  the  subspace  (aff  A)-1  is  of  dimension 
d  —  dim(A).  ■ 

Proposition  3.1.18  Suppose  g  =  Lc  has  the  property  that  there  exist  a  vector  A‘  6 
relint  A  such  that  cT A"  =  g{ A)  for  all  c  €.  C.  Suppose  we  are  given: 

i.  Hyperplanes  H(  (6  €  {  —  1,0,1}),  where  H(  =  {A  j  arA  =  a  4-  £}  for  some 
vector  a  and  a  6  R,  and  Ho  contains  A' . 

it.  Sets  of  vectors  Ct  —  {c{, . . .  ,c{Tf}  C  C  and  a 6  €  R+  (6  G  {—1,0, 1})  such  that 
Lc,  is  a  minimal  weak  approximation  of  g\H(,  and  £*Ia  aici  a  balancing 
combination  of  Ct  relative  to  Hf  ( 6  €  {  —  1,0,1}). 

By  using  0(dP  £<€{_li0ii}r<)  operations  we  can  compute  a  set  of  vectors  C  and  a 
balancing  combination  of  C  relative  to  such  that  C  C  an^  L^  is  a 

minimal  weak  approximation  of  g. 

Proof:  We  give  a  description  of  an  algorithm.  In  the  first  step  the  algorithm 
computes  a  set  C'  C  C  of  size  at  most  £*€{_i|0,i}  rs  and  a  balancing  combination  of 
C',  such  that  Lc  is  a  weak  approximation  of  g.  This  is  done  as  follows. 

i.  If  A  C  Ho,  then  C'  =  U«€{o,i,-i}  £«  is  such  that  Lc  is  a  weak  approximation  of  g. 

To  find  a  balancing  combination,  the  algorithm  computes  fit  >  0  (6  €  { — 1, 0, 1}) 
such  that  fit  aici  =  Otherwise, 

ii.  Hf  fl  A  ^  0  (for  either  6  =  1  or  8  =  —  1).  It  turns  out  that  the  set  C'  =  C<  is 
such  that  Lc  is  a  weak  approximation  of  g.  Hence,  the  algorithm  chooses  a6 
as  the  coefficients  of  a  balancing  combination. 

Assuming  that  C'  is  a  weak  approximation  of  g  the  algorithm  proceeds  as  follows. 
In  case  ii,  C  =  C'  is  obviously  a  minimal  weak  approximation  of  g.  In  case  i  it 
follows  from  Corollary  3.1.17  that  by  using  0(iP  £<e{-i,o,i}r<)  operations  we  can 
compute  C  CC  and  a  balancing  combination  for  C,  such  that  Lj  is  a  minimal  weak 
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approximation  of  g. 

What  remains  is  to  prove  that  Lc  is  indeed  a  weak  approximation  of  g.  Denote 
by  A{  C  Hi  the  set  of  maximizers  of  g  restricted  to  He.  By  using  an  appropriate 
translation  of  the  coordinates,  we  can  transform  the  problem  so  that  C  C  Rd~l, 
g  :  — *  R,  He  =  {A  G  i?d-1  |  aT A  —  £},  A”  =  0,  and  g(A)  =  0.  This  transfor¬ 

mation  preserves  the  property  of  a  subset  being  a  weak  approximation  or  having 
a  balancing  combination  relative  to  a  given  flat.  A  balancing  combination  in  the 
transformed  problem  corresponds  to  a  balancing  combination  relative  to  Rd  in  the 
original  one.  Note  that  in  the  transformed  problem,  0  6  A,  so  the  flat  aff  A  is  a 
subspace. 

Let  Af  G  relint  A;.  Observe  that  g  is  linear,  that  is,  for  all  a  >  0,  g(aA)  =  ag(A). 
Thus,  in  case  i  A  =  A0.  In  case  ii  consider  the  intersection  of  A  with  the  open  halfs¬ 
pace  {A  |  £(arA)  >  0}.  The  set  A  is  convex.  Thus,  if  is  not  empty,  it  must  contain 
a  relative  interior  point  of  A.  The  hyperplane  He  is  contained  in  this  halfspace  and 
hence  this  intersection  is  not  empty,  and  dim  A s  =  dim  A  —  1.  Note  that  0  £  aff  A*. 
Thus,  the  flat  aff  A*  0{/3Aj  |  fi  G  i2}  must  be  contained  in  aff  A.  On  the  other  hand 
this  flat  is  of  the  same  dimension  as  aff  A  and  therefore  equality  holds.  Consider 
any  A'  G  Rd~l  such  that  A'  ^  aff  A.  In  order  to  prove  that  Lc  weakly  approximates 
g  we  need  to  show  that  there  exists  a  c  G  C'  such  that  cT A'  <  0. 

First,  we  prove  case  i.  If  a? A1  =  0  then  we  are  done,  since  A  £  Ho  and  Co  C  C'. 
Otherwise,  assume  without  loss  of  generality  that  a? A'  =  &  >  0.  Consider  the 
vector  A  =  A//3.  The  vector  A  lies  on  the  line  determined  by  0  and  A,  and  is  such 
that  aT A  =  1.  Since  aff  A  is  a  flat,  if  it  contains  two  points  on  a  line,  it  contains  the 
whole  line.  Therefore,  0  G  aff  A  and  A'  £  aff  A  imply  that  A  £  aff  A.  It  follows  that 
there  exists  a  c  €  C\  such  that  A  c  <  0.  Finally,  observe  that  A a c  =  (/0A)rc  <  0. 

To  prove  case  ii,  define  A"  to  be  the  vector  A"  =  A'  4-  $A\  where  0  =  (6  — 
aTA')/(aTAms).  Note  that  aTA"  =  6  and  thus  A"  G  H$.  Moreover,  A"  0  aff  he, 
since  otherwise  A'  G  aff  A.  Thus,  there  exists  a  c  G  Cf  such  that  cT A"  <  0.  We 
have  A|  G  aff  A  and  therefore  A,;  has  the  same  projection  on  (aff  A)x  as  A'.  It  follows 
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from  Proposition  3.1.11  that  for  all  c'  £  C{,  c'  €  (aff  A)~.  Hence,  cT A1  =  cT A"  <  0 

To  find  a  balancing  combination,  recall  from  Proposition  3.1.15  that  there  exist 
numbers  71  >  0,7-!  <  0  such  that  £*=!  ol\c\  =  7 fa.  Hence,  we  choose  fit  =  7i-f|, 
fio  =  1.  It  is  easy  to  verify  that 

H  fit  ^2  aici  =  0  . 

<€{-1,0,1}  i= 1 


3.1.3  Back  to  zero-cycles 

We  are  now  ready  to  give  the  deferred  proof  from  Chapter  2. 

Proof  of  Proposition  2.4.9:  Observe  that  the  weight  of  a  minimum  weight  cycle 
in  a  graph  with  the  scalar  weights  a\T  f  (a  >  0)  is  linear  in  a.  If  the  graph  G  has 
a  witness  A  then  either  =  0  or  the  vector  (l/A„t)A  is  a  witness  as  well.  Therefore, 
we  can  restrict  our  search  on  the  A-space  to  vectors  A  E  R?  where  Ad  E  {-1,0, 1). 
Denote 

KS  =  KC  {y:yd  =  S}  (8  E  {-1,0,1})  . 

Run  the  parametric  minimum  cycle  (resp.,  parametric  minimum  cycle-mean)  al¬ 
gorithm  on  G  as  defined  in  Problem  3.1.7  (resp.,  Problem  3.1.5)  three  times  with 
the  following  ( d  —  Invariable  linear  functions.  For  8  =  —1,0,1,  we  associate  with 
the  edges  the  linear  functions  fTAf  where  A*  =  (Ai,...,A 4-1, 6)T.  If  a  witness 
for  G  exists,  it  must  be  found  during  at  least  one  of  the  three  runs.  Otherwise, 
the  maximum  of  the  function  g  in  each  of  the  three  subproblems  is  nonpositive. 
Thus,  for  all  three  instances,  the  corresponding  sets  .  „  [8  6  {—1,0,1})  consist  of 
vectors  for  which  g  =  0.  The  algorithm  computes  nonzero  vectors  A}  €  rel  int  Ks, 
8  €  {—1, 0, 1}  (if  Kf  0),  along  with  collections  (of  size  |C^|  =  0(d))  of  cycles 
such  that  ^  C€C(<)^  is  a  minimal  weak  approximation  of  g  restricted  to  the 

hyperplane  A 4  =  8,  and  a6  are  coefficients  of  balancing  combinations  of  C*  relative 
to  Ad  =  8.  To  find  the  separating  vector  A  we  proceed  as  follows,  (i)  If  Ks  #  0  for 
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6  €  {  —  1, 1},  we  choose  A  to  be  Aj,  and  then 

A  =  A}  €  relint  AC$  C  relint  AC  . 

(ii)  If  K.-i  =  K\  =  0  and  AC0  ^  {0},  we  choose  A  to  be  a  nonzero  vector  A„  € 
relint  AC0,  and  so 

A  =  Aq  6  rel  int  AC0  =  rel  int  AC  . 

(iii)  The  remaining  case  is  AC_i  =  ACj  =  0  and  AC o  =  {0}.  Here  we  conclude  that 
K  =  {0},  so  there  is  no  separating  vector. 

As  done  in  the  proof  of  Proposition  3.1.18,  using  0(d °)  operations,  we  can  find  a  set 
C  of  cycles  and  a  balancing  combination  for  the  cycle  values  {/(C)  |  C  €  C},  such 
that  the  lower  envelope  of  the  set  of  cycle  values  is  a  minimal  weak  approximation 
of  the  function  g.  To  conclude  the  proof  observe  that  the  size  of  a  minimal  weak 
approximation  is  at  most  0(2d)  (see  Corollary  3.1.17),  and  that  coneC  D  ORTH(G) 
(see  Corollary  2.4.11).  | 

3.2  Algorithm  for  parametric  min  cycle-mean 


In  this  subsection  we  sketch  a  strongly  polynomial  time  algorithm  for  the  parametric 
minimum  cycle-mean  problem.  As  mentioned  before,  this  algorithm  is  simpler  than 
the  parametric  minimum  cycle  algorithm.  However,  its  time  bounds  are  worse.  The 
purpose  of  discussing  it  here  is  to  give  the  reader  some  intuition  and  explain  some  of 
the  main  ideas  behind  the  strongly  polynomial  bounds.  Thus,  only  the  ideas  that  are 
essential  for  strongly  polynomial  time  bounds  are  introduced.  Two  simplifications 
are  made.  First,  the  multi-dimensional  search  technique  is  not  discussed.  It  improves 
the  time  complexity,  but  is  not  essential  for  strongly  polynomial  bounds.  The  second 
simplification  is  that  the  algorithm  only  decides  the  existence  of  a  zero-cycle.  There¬ 
fore,  the  collection  C  (see  Problem  2.4.7)  need  not  be  computed,  and  it  suffices  to  be 
able  to  compute  a  witness  or  a  separating  vector. 
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Remark  3.2.1  The  minimum  cycle-mean  relative  to  a  set  of  scalar  weights  can  be 
found  in  0(f\E\  •  |V'|)  time  by  an  algorithm  due  to  Karp  [38].  With  n3  parallel  proces¬ 
sors,  the  minimum  cycle-mean  can  be  computed  in  0(logsn)  time  (see  [45]). 


Proposition  3.2.2  The  value  of  g( A)  can  be  described  as 


g( A)  =  max  min  (— 7r;  +  7r,  —  ArC{.  )  . 


Proof:  Given  A  6  Rf ,  consider  the  following  linear  programming  problem: 
Minimize  y~](Arc^)z^- 

subject  to  =  0  (i  =  1, . . . ,  n) 

(P) 

=  1 

x  >  0  . 

Obviously,  (P)  has  an  optimal  solution.  Now,  every  feasible  solution  x  of  (P)  is  a 
convex  combination  of  feasible  solutions  where  E(x')  is  a  simple  cycle.  Since 
£  x\j  =  1,  the  value  of  the  objective  function  at  each  x'  is  precisely  the  mean  weight 
of  the  cycle  determined  by  x'.  Hence  ( P )  has  an  optimal  solution  of  the  form  x' . 
In  other  words,  the  optimal  value  of  (P)  is  equal  to  g( A).  Consider  the  dual  of  (P): 

Maximize  t 

(D) 

subject  to  iTi  —  7Tj  +  t  <  —A TCij  . 

Our  claim  follows  from  the  fact  that  the  optimal  value  of  (D)  is  equal  to  g( A).  ( 


Corollary  3.2.3  The  problem  of  maximizing  g(  A)  can  be  formulated  as  a  linear  pro • 
gramming  problem: 


Maximize  t 

wtttX 

subject  to  Vi  —  irj  +  XTCij  -f  t  <  0  . 
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The  dual  of  (P)  is  the  following  zero- circulation  problem: 

Minimize  ^  dijXij 

subject  to  —  Xji)  =  0  (i  =  1, . . . ,  n) 

i 

(D)  =  l 

xncn  =  ® 

x  >  0  . 

Where  c{j  =  cij  €  Qd~l  ■ 

Remark  3.2.4  In  view  of  Corollary  3.2.3,  the  problem  of  maximizing  p(A)  can  be 
solved  by  an  extension  of  the  algorithm  for  linear  programming  in  fixed  dimension  [47] 
and  its  improvements  [5,  19].  Note  that  only  the  dimension  of  the  A-space  is  fixed, 
whereas  the  number  of  nfs  varies.  However,  we  will  work  recursively  on  the  space  of 
A’s  where  at  the  base  of  the  recursion  ( d  =  1)  we  have  a  problem  of  the  form  of  (D), 
i.e.,  a  problem  with  fixed  scalar- weights  which  is  equivalent  to  the  usual  minimum 
cycle-mean  problem  (with  no  parameters). 

Before  discussing  the  algorithm  for  Problem  3.1.5  we  present  the  respective  “ora¬ 
cle”  algorithm. 

Problem  3.2.5  Given  G  =  ( V,E,f )  and  a  hyperplane  H  in  the  A-space,  solve 
Problem  3.1.8  for  g  relative  to  H. 

Problem  3.2.5  (an  oracle  call)  is  solved  by  recursive  calls  to  an  algorithm  for 
Problem  3.1.5  on  input  graphs  with  vector  weights  of  a  lower  dimension  as  we  argue 
below. 

Theorem  3.2.6  Problem  8.2.5  can  be  solved  by  three  recursive  calls  to  an  algorithm 
for  Problem  8.1.5  in  dimension  d—\.  The  complexity  of  the  additional  computation 
is  dominated  by  the  calls  to  Problem  8.1.5. 


54 


CHAPTER  3.  PARAMETRIC  MINIMUM  CYCLE 


Proof:  Consider  Problem  3.1.5  subject  to  aT A  =  a  (and  A  6  Rf).  This  is  in 
fact  a  (d  —  l)-dimensional  version  of  the  original  problem  restricted  to  H  =  {A  € 
Rf\aTA  =  a}.  If  g  is  unbounded  on  H,  then  this  fact  is  detected;  otherwise,  suppose 
A^  is  in  the  relative  interior  of  the  set  of  maximizers  of  <?(A)  subject  to  A  6  H. 
Suppose  we  also  have  corresponding  values  . . .  ,7r°  and  =  ^(A^).  We  wish 
to  recognize  whether  A^  is  also  a  relative  interior  point  of  the  set  of  global  maxima 
(i.e.,  relative  to  Rf).  If  not,  we  wish  to  decide  whether  for  all  A'  6  Rf  such  that 
$(*")  >  necessarily  aT  A“  >  a,  or  whether  for  all  of  them  aT A*  <  a;  these 

are  all  the  possible  cases.  Let  E°  denote  the  set  of  edges  (i,j)  such  that 

-»<  +  ir®  -  (A °)TCij  =  t°  . 

Notice  that  the  subgraph  G°  of  G  induced  by  E°  contains  all  the  cycles  of  G  whose 
cycle-mean  is  minimum  relatively  to  the  weights  (A°)rcly,  and  only  such  cycles.  In 
order  to  determine  whether  A0  is  in  the  relative  interior  of  the  global  maxima  (and 
if  not  determine  which  side  of  H  contains  it),  we  can  consider  the  behavior  of  g 
on  two  hyperplanes  close  to  H.  These  hyperplanes  are  parallel  to  H,  one  on  each 
side.  We  consider  the  local  maxima  relative  to  these  planes.  In  order  to  avoid  the 
problem  of  how  close  to  get,  we  can  consider  just  the  pieces  of  g  which  are  active 
at  A0.  This  is  equivalent  to  considering  only  cycles  whose  cycle-mean  is  minimum 
relative  to  (A°)rCy,  namely,  the  cycles  of  the  subgraph  G°.  We  now  solve  two 
problems  on  G°  of  maximizing  t  subject  to 

Ti  -  TTj  +  ArCij  +  t  <  0  €  E°)  , 

where  in  one  of  the  problems  we  also  include  the  constraint  aT A  =  a  —  1,  _iid  in 
the  other  we  include  the  constraint  orA  =  o  +  l.  Both  problems  can  be  solved 
as  (d  —  l)-dimensional  problems  since  one  of  the  Aj’s  can  be  eliminated.  Denote 
the  optimal  values  of  these  problems  by  and  Only  one  of  the  optimal 

values  cam  be  greater  than  t^°\  If  this  is  the  case,  or  if  one  of  1 equals  t 
and  the  other  is  smaller,  then  the  side  of  the  hyperplane  that  contains  relint  A 
is  determined.  Otherwise,  if  either  both  are  less  than,  dr  both  are  equal  to  f(°), 
then  t is  the  global  optimal  value.  The  base  of  the  recursion  is  the  1-dimensional 
problem  (with  no  parameters  at  all).  | 
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A  sketch  of  the  algorithm  for  Problem  3.1.5  is  given  below. 

Algorithm  3.2.7  Consider  a  minimum  cycle-mean  algorithm  that  uses  only  com¬ 
parisons,  additions,  and  divisions  (multiplications)  by  scalars  as  primitive  operations. 
Define  the  corresponding  lifted  algorithm  to  operate  on  edge  weights  that  are  lin¬ 
ear  functions,  defined  by  /(e)  (e  €  E).  The  lifted  algorithm  maintains  a  collection 
H  of  open  halfspaces  which  is  initialized  as  the  empty  set.  Additions  and  divisions 
by  scalars  are  naturally  “lifted”  to  additions  and  scalar  divisions  of  linear  functions. 
Comparisons  are  more  intricate.  To  “compare”  two  linear  functions,  we  compute  the 
hyperplane  H  such  that  the  result  of  a  comparison  is  uniform  throughout  each  of  the 
halfspaces  defines  by  H.  We  then  perform  an  oracle  call  on  H.  If  the  oracle  call  did 
not  result  in  a  global  solution,  a  halfspace  h  is  found  (one  of  the  sides  of  H)  which 
contains  the  maxima  of  g.  The  result  of  the  comparison  is  the  relation  between  the 
linear  functions  that  holds  throughout  h.  The  halfspace  h  is  then  added  to  7i  and 
the  algorithm  continues.  If  the  algorithm  terminates  and  no  oracle  call  resulted  in 
a  solution,  a  separating  vector  (or  a  witness)  is  found  by  considering  a  point  in  the 
intersection  of  all  the  open  halfspaces  in  H. 


3.3  Algorithm  for  parametric  minimum  cycle 

This  section  introduces  an  algorithm  for  the  parametric  minimum  cycle  problem.  This 
problem  is  essentially  an  instance  of  Problem  3.1.3  for  a  particular  family  of  functions. 
In  Chapter  4  we  use  the  general  lines  of  this  algorithm  to  solve  Problem  3.1.3  for  any 
concave  function  g  given  by  a  piecewise  affine  algorithm. 

We  suggest  an  oracle  algorithm  which  relies  on  making  recursive  calls  to  instances 
of  Problem  3.1.7,  where  there  are  fewer  parameters,  but  the  weights  are  more  complex. 

In  order  to  facilitate  the  recursion,  the  problem  is  generalized  to  the  extended 
parametric  minimum  cycle  problem  (EPMC).  The  algorithm  presented  here  solves 
the  EPMC  problem.  Let  us  start  with  a  definition  of  this  problem. 
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Extended  parametric  minimum  cycle.  Let  G  =  ( \',E,w,f )  be  a  graph  with 
two  sets  of  vectors  associated  with  the  edges,  i.e. ,  for  every  e  €  E,  /(e)  €  Rd  and 
w(e)  €  R*.  We  identify  /(e)  with  the  ( d  —  l)-dimensional  linear  function: 

/(e)  =  /l(e)^l  +  •  •  *  +  fd-l(e)^d-l  +  fd(e)  • 

The  weight  of  an  edge  e  is  the  (£  +  l)-tuple  (tui(e), . . .  ,  u>/(e),  /(e)).  The  definitions 
given  in  3.1.6  are  extended  as  follows. 

Definition  3.3.1  Consider  a  graph  G  =  (V,E,w,  /),  where  for  e  €  E,  /(e)  =  ce  € 
Rd. 

i.  A  cycle  C  (not  necessarily  simple)  is  called  w -minimal  if  the  value  w(C)  = 

to(e)  (where  edges  are  counted  as  many  times  as  they  occur  on  the  cycle) 
is  minimal  relative  to  the  lexicographic  order  on  Rl. 

ii.  Let  C  =  C(A)  denote  a  tn-minimal  cycle  of  at  most  n  edges  which  minimizes 
the  weight  £e€C  A Tce.  Denote  g{ A)  =  /(C)rA.  Note  that  the  first  l  coordi¬ 
nates  of  the  vector  weight  of  a  minimum  cycle  (i.e.,  the  values  given  by  to)  are 
independent  of  A;  g(A)  gives  only  the  ((  +  l)st  coordinate. 

Remark  3.3.2  For  a  graph  with  vector  weights  in  Rl,  the  minimum,  relative  to  the 
lexicographic  order,  among  cycles  of  length  less  than  or  equal  to  n  can  be  computed 
in  one  application  of  an  all-pairs  shortest  path  algorithm.  This  takes  0(i\E\  •  |V|) 
time  sequentially,  or  O(log2  n)  time  using  In 3  processors  in  parallel.  Therefore,  the 
function  #(A)  can  be  evaluated  by  a  piecewise  affine  algorithm  (see  Definition  3.1.2). 
Note  that  flr(A)  =  Lc-,  where  C  =  { f(C)\C  is  a  to-minimal  cycle}. 

Problem  3.3.3  [Extended  Parametric  Minimum  Cycle] 

Given  is  G  =  (V,  E,w, /)  as  above.  If  <j(A)  >  0  for  some  A,  then  output  any  such 
A;  otherwise,  find  A  €  relint  A  and  a  collection  C  =  {C\,...,CT}  (r  <  2<f)  of  to- 
minimal  cycles,  where  each  consists  of  at  most  n  edges,  such  that  the  lower  envelope 
L{f(c)\c$c}  cycle  values  is  a  minimal  weak  approximation  of  g,  and  find  a 

balancing  combination  of  the  cycle  values  /(Ci), . . . ,  f{Cr)  relative  to  Ef . 
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The  EPMC  algorithm  presented  below  performs  “oracle”  calls.  The  oracle  algo¬ 
rithm  recursively  solves  instances  of  the  EPMC  problem  on  G  with  sets  of  vector 
weights  w'  :  E  — *  R1*1  and  f'\E—>  Rd~l .  The  dimension  of  the  f  weights,  which 
corresponds  to  the  number  of  parameters,  decreases  in  the  recursive  calls.  In  order 
to  solve  an  instance  of  the  parametric  minimum  cycle  problem  (Problem  3.1.7),  we 
start  with  an  instance  of  EPMC  where  /  gives  the  vector  edge  weights  and  w  is  a 
set  of  null  vectors.  At  the  base  of  the  recursion  the  weights  /  are  null  vectors,  and 
the  problem  is  reduced  to  the  non-parametric  problem  of  computing  minimum  cycle 
relative  to  the  lexicographic  order  on  d-tuples. 

In  Subsection  3.3.3  we  propose  Algorithm  3.3.14  for  Problem  3.3.3.  The  algorithm 
executes  calls  to  the  oracle  problem  (Problem  3.1.8)  relative  to  g.  An  algorithm  for 
the  oracle  problem  is  given  in  Subsection  3.3.1.  A  call  to  the  oracle  is  a  costly 
operation.  Therefore,  one  wishes  to  solve  many  hyperplane  queries  with  a  small 
number  of  oracle  calls.  In  Subsection  3.3.2  we  discuss  the  multi-dimensional  search 
technique  (introduced  in  [47]).  By  applying  it,  we  are  able  to  reduces  the  number  of 
oracle  calls  performed  to  a  polylog  in  the  number  of  hyperplane  queries. 


3.3.1  Hyperplane  queries 

For  a  given  a  hyperplane  H  of  we  wish  to  solve  Problem  3.1.8  for  g  relative  to 
H.  If  H  fl  relint  A  ^  0,  we  solve  Problem  3.3.3,  that  is,  we  find  A  €  relint  A,  the 
collection  C,  and  a  balancing  combination  of  C. 

Problem  3.3.4  Given  is  G  =  (V,  E,w,f)  and  a  hyperplane  H  =  {A  £  R$  \  aX  =  a} 
in  the  A-space.  Solve  Problem  3.1.8  for  g  relative  to  H. 

Theorem  3.3.5  Problem  S.S.4  can  be  solved  by  an  algorithm  which  performs  three 
calls  to  instances  of  Problem  S.S.S  where  f  is  (d  —  l)-dimensional.  The  time  com¬ 
plexity  of  the  additional  computation  is  dominated  by  these  calls:  It  can  be  done 
sequentially  in  C\E\  +  D  time  for  some  constants  C  =  0(d)  and  D  =  0(<f3).  In 
parallel,  it  can  be  done  in  constant  B  =  O(cP)  time  using  0(m  -1-  n3)  processors. 
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Proof:  Consider  Problem  3.3.3  subject  to  a7 A  =  a  and  A  £  Rf.  This  is  in  fact 
a  (d  —  l)-dimensional  version  of  the  original  problem  restricted  to  the  hvperplane 
H  —  {A  €  |  a7 A  =  a}.  If  g  is  unbounded  on  H,  then  this  fact  is  detected; 

otherwise,  suppose  A^  is  in  the  relative  interior  of  the  set  of  maximizers  of  17(A) 
subject  to  A  £  H,  and  we  get  the  collection  C ^  and  a  balancing  combination  of 
C relative  to  H .  Suppose  t ^  =  ^(A^).  We  wish  to  recognize  whether  A^  is 
also  a  relative  interior  point  of  the  set  of  global  maxima  (i.e.,  relative  to  i?f).  If 
not,  then  we  wish  to  decide  whether  for  all  A"  €  R*  such  that  ^(A")  >  g(A^), 
necessarily  a7 A"  >  a,  or  whether  for  all  of  them  aT A'  <  a.  These  are  all  the 
possible  cases.  Consider  G‘  =  (V,  E,w' ,  f),  where  w'  =  (tc,  fT  A^).  Note  that 
all  the  minimum  cycles  of  G'  correspond  to  minimum  cycles  with  value  at  A 
of  G.  We  solve  Problem  3.3.3  twice  on  G',  where  in  one  of  the  problems  we  also 
include  the  constraint  a7 A  =  a  —  1,  and  in  the  other  we  include  the  constraint 
aT A  =  a  +  1.  Both  problems  can  be  solved  as  (d  —  1)- dimensional  problems  since 
one  of  the  A;’s  can  be  eliminated.  Denote  the  optimal  values  of  these  problems 
by  fW,  the  corresponding  collections  of  cycles  by  C6,  and  let  af  be  coefficients 
of  a  balancing  combination  of  Cf  (6  €  {—1,1}).  Only  one  of  the  optimal  values 
can  be  greater  than  t^°\  If  this  is  the  case,  or  if  one  of  equals  and  the 

other  is  smaller,  then  the  side  of  the  hyperplane  that  contains  relint  A  is  determined. 
Otherwise,  if  either  both  are  less  than,  or  both  are  equal  to  t^°\  then  is  the  global 
optimal  value.  In  the  latter  case  A^  €  relint  A.  It  follows  from  Proposition  3.1.9 
that  the  pieces  of  g  which  are  active  in  a  minimal  weak  approximation  have  the 
value  t at  X^°\  Thus,  a  minimal  weak  approximation  of  the  function  g'  (see 
Definition  3.3.1)  which  corresponds  to  G'  is  a  minimal  weak  approximation  of  the 
“original”  g  which  corresponds  to  the  graph  G.  The  conditions  of  Proposition  3.1.18 
and  Corollary  3.1.17  hold  for  the  function  g'.  Hence,  in  0(<P)  operations  we  can 
construct  a  minimal  weak  approximation  of  <7,  and  find  a  corresponding  balancing 
combination.  The  base  of  the  recursion  is  the  one-dimensional  problem  (with  no 
parameters  at  all)  where  C  consists  of  a  single  cycle  with  minimal  value,  and  the 
balancing  combination  is  the  value  of  this  cycle.  1 


3.3.  ALGORITHM  FOR  PARAMETRIC  MINIMUM  CYCLE 


59 


3.3.2  Employing  multi-dimensional  search 

The  multi-dimensional  search  problem  was  defined  and  used  in  [47]  for  solving  linear 
programming  problems  in  fixed  dimension. 

Problem  3.3.6  [multi-dimensional  search] 

Suppose  there  exists  an  unknown  convex  set  A'  C  Rd,  and  an  oracle  is  available 
such  that  for  any  query  hyperplane  H  in  Rd,  the  oracle  tells  whether  A'  D  H  =  0; 
if  so,  then  the  oracle  tells  which  of  the  open  halfspaces  determined  by  H  contains 
A'.  For  m  given  query  hyperplanes,  determine  the  location  of  X  relative  to  each  of 
the  hyperplanes,  or  find  any  hyperplane  (not  necessarily  one  of  the  given  ones)  which 
intersects  X. 

The  following  theorem  was  proven  in  [47]: 

Theorem  3.3.7  The  solution  of  Problem  S.S.6  relative  to  some  m/2  of  the  given  hy¬ 
perplanes  can  be  found  by  using  7  =  7 (d)  oracle  calls.  The  additional  computation  can 
be  done  sequentially  in  0{^{d)m)  time,  and  on  m  parallel  processors  in  0(~f(d)  log  m) 
time.  From  [5,  19]  7(d)  =  0((b/9)dZ*). 

Corollary  3.3.8  Problem  S.S.6  can  be  solved  by  using  0(7(d)log  m)  oracle  calls  and 
0(-y(d)m)  additional  time  (see  [47]). 

Remark  3.3.9  The  procedure  described  in  [47]  can  be  parallelized  so  that  the  ad¬ 
ditional  computation  (besides  the  0(i(d)  log  m)  oracle  calls)  is  done  by  m  parallel 
processors  in  0(~t(d)  log2  m)  time. 

Definition  3.3.10  We  define  a  partial  order  on  R?  \  {0}  as  follows.  For  any  pair  of 
distinct  vectors  ai}a2  €  Rd,  denote 

H  =  H{aua  2)  =  {A  €  J#  :  of  A  =  of  A}  . 

If  ^  is  unbounded  on /f(ai,Oj)  or  if  if(ai,a2)nrelint  A  ^  0,  then  we  write  Oi  <>a  a2. 
Otherwise,  g  can  be  unbounded  on  at  most  one  of  the  open  subspaces  determined  by 
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H,  and  also  relint  A  can  intersect  at  most  one  of  these  open  halfspaces.  We  denote 
ai  < a  0,2  (respectively,  a,i  >A  <12)  if  there  exists  a  A  €  relint  A  such  that  a[A<a[A 
(respectively,  af  A  >  a\  A),  in  which  case  the  same  holds  for  all  these  A’s,  or  if  g  is 
unbounded  on  the  halfspace  determined  by  the  inequality  afA  <  afA  (respectively, 
af  A  <  afA).  We  also  use  the  notation  <p  for  a  similar  partial  order  relative  to  any 
set  P. 

Problem  3.3.11  Given  are  finite  sets  Ai,...,Ar  of  nonzero  vectors,  where  A,  = 
{aj,...,a*.}  (at  £  Rd )  and  s  =  We  wish  either  to  find  a  minimal  element, 

with  respect  to  the  partial  order  <A,  in  each  of  the  sets  Ai,  or  (if  we  encounter  two 
incomparable  elements)  to  reduce  the  problem  to  a  lower  dimension.  More  specifically, 
we  need  to  find  either  one  of  the  following: 

i.  A  collection  of  closed  halfspaces  whose  intersection  P  contains  relint  A,  and 
indices  1  <  TOj  <  a*  (»  —  1, . . .  ,r)  such  that  for  every  1  <  i  <  r  and  every 
1  <  j  <  Si,  j  /  mi,  we  have  a^  <A  a)  and  aj*.  <P  a). 

ii.  A  hyperplane  H  such  that  either  g  is  unbounded  on  H  or  H  fl  relint  A  ^  0. 

Proposition  3.3.12  Problem  8.8.11  can  be  solved  using  0(7(d  —  l)log  s)  oracle  calls 
plus  either 

i.  0(7(d  —  l)log2s)  parallel  time  on  0(a)  processors,  or 
ii.  0{i{d  —  l)aloga)  sequential  time. 

(Where  7 (d  —  1)  is  as  in  Theorem  8.8.7.) 

Proof:  The  underlying  algorithm  is  an  extension  of  the  multi- dimensional  search 
procedure  for  Problem  3.3.6  mentioned  in  Theorem  3.3.7.  Here  the  set  X  is  ei¬ 
ther  relint  A  or  (if  the  latter  is  empty)  a  domain  where  the  function  g  is  un¬ 
bounded.  Thus,  the  case  where  X  intersects  the  query  hyperplane  corresponds 
to  the  case  where  either  g  is  unbounded  on  the  query  hyperplane,  or  the  latter 
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intersects  relint  A;  the  case  where  A'  is  contained  in  one  of  the  open  halfspaces 
corresponds  to  the  case  where  either  relint  A  is  contained  in  the  halfspace,  or  g  is 
unbounded  on  the  halfspace  (but  bounded  on  the  hyperplane).  Also,  note  that  the 
flat  R f  on  which  the  multi- dimensional  search  is  done,  is  of  dimension  d  —  1. 

We  first  explain  how  to  recognize  for  a  given  pair  of  distinct  vectors  ,  a  j  whether 

<a  P2,  &2  <a  Oi  or  ai  <>a  a.1.  Consider  the  hj’perplane  H  =  H(a.i,aj)  C  R f 
(see  Definition  3.3.10).  Suppose  H  is  the  query  hyperplane  presented  to  an  oracle 
which  recognizes  the  location  of  the  set  rel  int  A  relative  to  H  (in  the  sense  of  Prob¬ 
lem  3.3.4).  In  particular,  if  H  fl  relint  A  ^  0,  then  the  oracle  discovers  this  fact 
and  returns  a  A  €  H  fl  relint  A.  Similarly,  if  g  is  unbounded  on  H,  then  the  oracle 
reports  this  fact  and  provides  a  A  such  that  g( A)  >  0.  In  the  remaining  cases  the 
oracle  reports  either  <a  a2  or  a2  <a  ai- 

The  multi- dimensional  search  algorithm  computes,  adaptively,  0(~i{d—  l)log  s)  hy¬ 
perplane  queries.  If  any  of  these  hyperplanes  either  intersects  relint  A  or  has  g 
unbounded  on  it,  then  this  fact  is  reported,  and  the  present  problem  is  considered 
solved.  Otherwise,  each  of  the  hyperplanes  determines  a  closed  halfspace  such  that 
the  intersection  P  of  all  these  halfspaces  has  the  following  property:  either  g  is  un¬ 
bounded  on  the  interior  of  P,  or  relint  A  is  nonempty  and  contained  in  the  interior 
of  P.  Moreover,  vectors  ami, . . . ,  aTnr  are  found,  such  that  for  every  i,  a'm.  <p  a’, 
lor  all  1  <  j  <  j  ^  TO*. 

We  implement  the  algorithm  as  follows.  View  the  dimension  d  as  fixed.  It  was 
shown  in  [5]  and  [19]  that  by  using  a  constant  number  of  oracle  calls  (which,  how¬ 
ever,  grows  exponentially  with  the  dimension)  one  can  locate  X  relative  to  at  least 
half  of  the  hyperplanes.  A  similar  scheme  can  be  applied  here.  We  apply  0(log  s) 
phases.  First,  for  each  i  (1  <  z  <  r)  we  match  the  members  of  A,-  into  s;/2  ar¬ 
bitrary  pairs.  This  is  done  with  at  most  s/2  processors.  We  then  calculate  the 
corresponding  (at  most  s/2)  hyperplanes  if  (a;,  a,)  (see  Definition  3.3.10).  In  a 
constant  number  7 (d  —  1)  of  oracle  calls  and  0(lpg  s)  time  we  can  locate  relint  A 
relative  to  half  of  these  s/2  hyperplanes;  unless  one  of  these  hyperplanes  turns  out 
to  be  a  valid  output  (in  the  sense  of  (ii)  in  Problem  3.3.11).  We  now  drop  one 
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vector  from  every  pair  for  which  the  location  relative  to  relint  A  has  been  found. 
The  same  is  repeated  with  the  remaining  3$/4  vectors,  and  so  on.  Altogether,  we 
run  in  0(log  s)  phases,  each  of  which  takes  0(i{d  —  l)log  s)  time  on  0(s )  proces¬ 
sors,  and  0(,y(d  —  1))  oracle  calls.  The  sequential  time  bound  is  0{i(d  —  l)slogs) 
plus  0(7(d  —  1 )  log  s )  oracle  calls.  In  parallel,  using  O(s)  processors,  the  time  is 
0(~i(d  —  l)log2.s)  plus  0(y(d  —  l)logs)  oracle  calls.  | 

3.3.3  Algorithm  for  extended  parametric  minimum  cycle 

The  algorithm  described  below  solves  Problem  3.3.3.  It  finds  a  vector  A  €  relint  A, 
unless  g(A)  >  0  for  some  A,  in  which  case  the  algorithm  outputs  such  a  A.  It  also 
returns  a  collection  C  of  tr-minimal  cycles  such  that  the  lower  envelope  C£Cy 

of  the  linear  functions  defined  by  /(C)  is  a  minimal  weak  approximation  of  g.  The 
number  of  cycles  in  C  is  at  most  2d. 

Definition  3.3.13  Consider  a  scalar  minimum  cycle  algorithm,  where  the  only  prim¬ 
itive  operations  on  expressions  that  depend  on  the  edge  weights  are  additions,  multi¬ 
plications  by  scalars,  and  comparisons.  We  define  the  corresponding  lifted  algorithm 
for  input  graphs  of  the  form  G  =  (V,  E,  f,w).  The  weight  of  an  edge  e  €  E  on 
these  input  graphs  is  an  (£  +  l)-tuple  (u>i(e), . . .  /(e)),  where  /(e)  €  R?  is 

viewed  as  a  (d  —  l)-dimensional  linear  function  and  to(e)  =  (u;i(e), . . .  ,u/*(e)).  The 
lifted  algorithm  is  an  extension  of  the  scalar  minimum  cyde  algorithm  that  operates 
on  such  (i  +  1)- tuples  instead  of  scalars.  The  extension  of  the  operations  of  addi¬ 
tion  and  multiplication  by  a  scalar  is  straightforward,  namely,  given  tuples  (tOj,  /j), 
(w2,/2)  their  sum  is  (t»i  +  w>2,/i  +  f%),  and  the  multiplication  by  a  scalar  a  is 
(cn»i,a/j).  Comparisons  are  made  with  respect  to  a  lexicographic  partial  order  on 
the  (l  +  l)-tuples.  It  is  only  a  partial  order  since  in  the  (£  +  l)st  coordinate  we  have 
the  partial  order  <a  (see  Definition  3.3.10).  To  compare  two  (t  +  l)-tuples  (tOi,/,) 
and  (t02,/2),  we  first  compare  lexicographically  the  first  £  scalar  coordinates.  If  the 
comparison  is  not  resolved  there,  we  need  to  compare  the  linear  functions  fl  and 
f2.  For  this  purpose,  the  lifted  algorithm  computes  the  hyperplane  and 
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solves  Problem  3.3.4  (hyperplane  query)  relative  to  H.  A  set  of  hvperplane  queries 
is  resolved  by  performing  “oracle”  calls  (see  Subsection  3.3.2).  The  lifted  algorithm 
maintains  a  set  H  of  closed  halfspaces  which  initially  is  empty.  The  hvperplane  query 
decides  whether  or  not  the  vectors  are  comparable.  If  they  are,  it  decides  whether 
fi  <a  / 2-  If  / 1  <>a  f  2)  then  the  lifted  algorithm  halts  since  an  oracle  call  resulted 
in  a  solution  to  Problem  3.3.3.  Otherwise,  the  resolved  hyperplane  query  tells  us 
which  of  the  halfspaces  defined  by  H(fl,f2)  (see  Definition  3.3.10)  contains  the  set 
relint  A,  so  this  hyperplane  is  added  to  H. 


Algorithm  3.3.14  [Extended  parametric  minimum  cycle] 


Step  1.  Run  the  lifted  minimum  cycle  algorithm,  collecting  into  TL  all  the  halfs¬ 
paces  resulting  from  oracle  calls  where  comparisons  are  resolved.  Either  some 
oracle  call  resulted  in  a  global  solution,  or  otherwise,  the  algorithm  terminates 
normally.  Denote  by  Cm  the  minimum  cycle  found. 

Step  2.  Denote  by  P  the  intersection  of  the  halfspaces  in  H. 

i.  Compute  A'  €  relint  P.  This  amounts  to  a  linear  programming  prob- 
lemwith  d  —  1  variables  and  \H\  constraints,  and  hence  it  can  be  solved 
in  0([H\)  sequential  time  [47],  Note  that  the  size  of  Ti  is  bounded  by  the 
number  of  oracle  calls. 

ii.  If  the  function  L^c^  is  not  constant  on  Rf ,  that  is,  not  all  the  coef¬ 
ficients  Ji{Cm),  /:(€**),  •  •  • ,  equal  zero,  then  g  is  unbounded. 

Otherwise, 

iii.  consider  p(A*)  =  /^(Cm)- 

•  If  $r(A‘)  >  0,  then  output  A*  and  stop.  Otherwise, 

•  the  function  L^c^  is  a  weak  approximation  of  g,  and  P  —  A.  Hence, 
A*  €  relint  A.  Output  A*  and  C  =  {Cm}- 


64 


CHAPTER  3.  PARAMETRIC  MINIMUM  CYCLE 


3.3.4  Correctness 


If  an  oracle  call  results  in  a  solution  in  Step  1  of  Algorithm  3.3.14,  then  correctness 
follows  by  induction  on  the  dimension  (see  also  the  discussion  under  Hyperplane 
Queries).  We  now  assume  that  no  oracle  call  resulted  in  a  solution  in  Step  1.  In  this 
case,  a  collection  7i  of  closed  halfspaces  is  obtained.  Recall  that  if  an  oracle  call  on 
a  hyperplane  H  did  not  result  in  a  solution  then  the  returned  halfspace  h  has  the 
following  properties:  (i)  if  the  function  g  is  bounded  then  A  C  h  but  A  <f_  H,  (ii)  if 
the  function  g  is  unbounded,  then  it  must  be  bounded  on  the  hyperplane  H,  and 
unbounded  on  the  halfspace  h.  Let  P  be  the  polyhedron  P  =  (\eW  It  follows  that 
if  g  is  bounded  then  PDA,  and  if  g  is  unbounded  then  it  must  be  bounded  outside 
and  on  the  boundary  of  P.  Note  that  P  must  be  full  dimensional  (dim  P  =  d—  1),  for 
if  not  then  it  must  be  contained  in  one  of  the  query  hyperplanes,  which  contradicts 
the  previous  statement. 


Observe  that  for  all  pairs  ai,a2  of  vectors  compared  by  the  lifted  minimum  cycle 
algorithm,  one  of  the  following  must  hold:  either  a i  <a  a2  and  aa  <p  a2,  or  a2  <a 
ai  and  a2  <p  ai.  The  latter  is  obvious  when  we  perform  an  oracle  call  for  each 
hyperplane  query,  and  it  is  easy  to  see  that  it  still  holds  when  we  employ  the  multi¬ 
dimensional  search  technique  (see  Problem  3.3.11  and  Proposition  3.3.12)  and  solve 
these  hyperplane  queries  by  a  smaller  number  of  oracle  calls.  Thus,  the  vector  value 
/(CW)  of  the  minimum  cycle  found  by  the  algorithm  must  be  such  that  f[Cu)  <A 
f(C)  and  hence  /(Cat)  <p  f{C)  for  any  to-minimal  cycle  C.  It  follows  that  p(A)  = 
/(Cm)TA  for  all  A  €  P.  Thus,  g  is  unbounded  if  and  only  if  f(Cu)  is  not  a  constant, 
and  the  correctness  of  step  ii  follows.  To  show  the  correctness  of  step  iii  assume  that 
/( Cm )  is  a  constant,  and  thus  g  =  /«i(A)  for  all  A  €  P.  Since  P  D  A  we  have 
P  =  A.  It  follows  that  A"  €  relint  A,  aff  A  =  Pf,  and  “  a  minimal  weak 

approximation  of  g. 
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3.3.5  Complexity 

The  complexity  of  the  algorithm  is  related  to  the  number  of  oracle  calls.  We  would  like 
to  resolve  many  hyperplane  queries  by  performing  only  a  polylogarithmic  number  of 
oracle  calls.  Thus,  it  is  advantageous  to  group  together  many  comparisons  that  could 
be  done  “in  parallel”  and  employ  the  multi-dimensional  search  techniques  discussed 
in  Proposition  3.3.12. 

Theorem  3.3.15  Algorithm  3.3.14  can  be  implemented  with  complexity  as  follows 
( where  m  =  \E\  and  n  =  \V\). 

i.  0(log2dn  4-  logdm)  parallel  time  on  0{n3  -f  m)  processors. 

ii.  0(m(\og2d  n  -f-logdm))  sequential  time,  when  m  =  D(n3logn). 

Hi.  0(\og2d  n(n3  -I-  m ))  sequential  time,  when  m  =  O(n3logn)  and  m  =  12(n2). 
iv.  0(n3  log2^-2)  n  -t-  nmlog2^-1)  n)  sequential  time,  when  m  =  0(n2). 

Proof:  The  problem  of  all-pairs  shortest  path  can  be  solved  in  0(log2  n)  time  using 
n3  processors  by  the  Floyd-Warshall  algorithm  [13].  The  algorithm  for  this  problem 
runs  in  O(log  n)  phases.  During  the  first  phase,  the  minimal  among  all  the  parallel 
edges  is  determined  for  each  pair  of  vertices  which  are  linked  with  at  least  one  edge. 
In  general,  the  minimal  value  in  a  set  is  computed  for  0(n2)  sets,  each  with  0(n) 
elements.  More  precisely,  during  phase  l,  for  each  ordered  pair  (i,j)  of  vertices  we 
find  d\j,  the  length  of  shortest  path  from  t  to  j  consisting  of  at  most  2l  edges.  We 
use  the  relation  d*fl  =  min { d\j ,  min* { d^  -I-  dj^}}.  To  find  a  minimum  cycle,  we  run 
one  more  phase,  where  we  compute  a  minimum  of  the  diagonal  elements  in  the  dis¬ 
tance  matrix.  The  complexity  of  this  last  phase  is  dominated  by  the  other  phases. 
Each  phase  can  be  implemented  in  one  application  of  Problem  3.3.11,  with  s  =  m 
for  the  first  phase,  and  s  =  n3  for  the  remaining  O(logn)  phases.  The  complexity 
is  analyzed  in  Proposition  3.3.12. 

Denote  by  Tj  and  Rj,  respectively,  the  sequential  time  complexity  and  the  paral¬ 
lel  time  complexity  with  n3  +  m  processors,  of  the  d-dimensional  problem.  Recall 
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from  Theorem  3.3.5  and  Remark  3.3.9  that  the  time  complexity  of  one  oracle  call 
is  3Tk_!  =  0{Td-i)  on  a  single  processor  and  3J2j_i  =  0(Rd~\)  on  0(n3  -r  m) 
processors.  When  d  =  1,  an  oracle  call  can  be  implemented  simply  by  a  scalar 
minimum  cycle  algorithm.  We  derive  recursion  formulas  for  fZ,-  and  Tt.  The  or¬ 
acle  calls  are  executed  sequentiall}'.  First  we  derive  an  expression  for  the  paral¬ 
lel  complexity  when  d  —  1.  Note  that  the  problem  can  be  solved  by  employing 
7i3  processors  for  0(log27i)  time  plus  m  processors  for  0(log  m)  time.  Thus,  on 
0(n3  +  min{m,mlogm/log2n})  processors,  we  have  R i  =  0(log2  n  -f  log  m).  It 
follows  that 

Rd  =  0(log3  n  -f  log2  m  +  (log2  n  +  log  m)Rd- 1 ), 

which  proves  i. 

The  sequential  complexity  for  d  =  1  is  Ti  =  0(min{nm,m-i-n3}).  Parts  ii-iv  of  the 
proposition  follow  from  the  recursion: 

Td  =  0(nz log  n  -+  m  +  (log2  n  +  log m)Td-i). 

The  above  analysis  applies  only  to  the  complexity  of  Step  1  of  the  algorithm.  In 
Step  2  we  compute  A  6  relint  P.  This  is  done  by  solving  a  linear  program  with 
0(71)  equations  and  0(d)  variables.  The  size  of  Ti  is  at  most  the  number  of  oracle 
calls  performed,  that  is,  \TL\  =  O(log2  n-flog  m).  Therefore  the  complexity  of  Step  2 
is  dominated  by  that  of  Step  1.  | 

Remark  3.3.16  There  are  hidden  constant  factors  in  the  complexities  stated  in  The¬ 
orem  3.3.15,  which  depend  on  the  dimension  d.  First,  there  is  an  extra  ctor  of  d 
on  the  serial  time  complexity  and  the  number  of  parallel  processors,  since  most  “op¬ 
erations”  are  done  on  vectors  in  R1  and  take  d  time  units.  Second,  there  is  an 
0(2d~f(d—  1))  factor  on  the  time  complexities.  The  factor  7 (d—  1)  =  0((5/9)rf3^rf— ) 
is  due  to  the  multi- dimensional  search  (see  Subsection  3.3.2),  and  the  factor  of  0(3**) 
is  due  to  the  fact  that  an  oracle  call  involves  three  calls  to  a  problem  of  lower  dimen¬ 
sion.  It  follows  that  the  constant  factor  in  the  serial  time  complexity  is  0(d3di(d—  1)). 
In  the  parallel  case  there  is  a  factor  of  0(3di(d  —  1))  for  the  time  complexity,  and  a 
factor  of  0(d)  on  the  number  of  processors. 


Chapter  4 

Convex  optimization  in  fixed 
dimension 


In  Chapter  3  we  introduced  a  technique  which  enabled  us  to  solve  the  parametric 
minimum  cycle  problem  with  a  fixed  number  of  parameters  in  strongly  polynomial 
time.  In  the  current  chapter  we  present  this  technique  as  a  general  topi.  In  order  to 
allow  for  an  independent  reading  of  this  chapter,  we  repeat  some  of  the  definitions 
and  propositions  given  in  Chapter  3.  Some  proofs  are  not  repeated,  however,  and 
instead  we  supply  the  interested  reader  with  appropriate  pointers. 

Suppose  Q  C  R?  is  a  convex  set  given  as  an  intersection  of  k  halfspaces,  and  let 
g  :  Q  — ►  R  be  a  concave  function  that  is  computable  by  a  piecewise  affine  algorithm 
(i.e.,  roughly,  an  algorithm  that  performs  only  multiplications  by  scalars,  additions, 
and  comparisons  of  intermediate  values  which  depend  on  the  input).  Assume  that 
such  an  algorithm  A  is  given  and  the  maximal  number  of  operations  required  by  A  on 
any  input  (i.e.,  point  in  Q)  is  T.  We  show  that  under  these  assumptions,  for  any  fixed 
d,  the  function  g  can  be  maximized  in  a  number  of  operations  polynomial  in  k  and 
T.  We  also  present  a  general  framework  for  parametric  extensions  of  problems  where 
this  technique  can  be  used  to  obtain  strongly  polynomial  algorithms.  Norton,  Plotkin, 
and  Tardos  [50]  applied  a  similar  scheme  and  presented  additional  applications. 
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4.1  Introduction 


A  convex  optimization  problem  is  a  problem  of  minimizing  a  convex  function  g  over 
a  convex  set  S  C  Rd ■  Equivalently,  we  can  consider  maximizing  a  concave  function. 

We  consider  the  problem  of  maximizing  a  concave  function,  where  the  dimension 
of  the  space  is  fixed.  We  also  assume  that  the  function  g  is  given  by  a  piecewise  affine 
algorithm  (see  Definition  4.2.2)  which  evaluates  it  at  any  point. 

The  results  of  this  chapter  can  be  extended  easily  to  the  case  where  the  range  of  g 
is  Rl  for  any  l  >  1.  We  then  define  the  notions  of  maximum  and  concavity  of  g  with 
respect  to  the  lexicographic  order  as  follows.  We  say  that  a  function  g  :  Q  C  Rd  —*  R.1 
is  concave  with  respect  to  the  lexicographic  order  <iex  if  for  every  a  E  [0, 1]  and 

*,y  €  Q, 

ag(x)  4-  (1  -  ot)g(y)  <Ux  g{ax  +  (1  -  a)y)  . 

Applications  where  the  range  of  g  is  R2  were  given  in  [10]. 

In  Section  4.2  we  define  the  problem.  In  Section  4.3  we  introduce  the  subpj-oblem 
of  hyperplane  queries,  which  is  essential  for  the  design  of  our  algorithm.  In  Section  4.4 
we  discuss  the  mui.  -dimensional  search  technique  which  we  utilize  for  improving  our 
time  bounds.  In  Section  4.5  we  introduce  the  optimization  algorithm.  In  Sections  4.6 
and  4.7  we  prove  the  correctness  and  analyze  the  time  complexity  of  the  algorithm.  In 
Section  4.8  we  discuss  applying  the  technique  introduced  in  Chapters  3  and  4  to  obtain 
strongly  polynomial  time  algorithms  for  parametric  extensions  of  other  problems. 


4.2  Preliminaries 


Let  R%  denote  the  set  of  vectors  A  =  (A1} . . . ,  A,*)r  €  R?  such  that  Aj  =  1.  For 
A  €  denote  by  A  E  R$  the  vector  A  =  (A,  1).  For  a  halfspace  F,  denote  by  dF 
the  boundary  of  F,  i.e.,  dF  is  a  hyperplane. 
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Definition  4.2.1  For  a  finite  set  C  C  Rd+1 ,  denote  by  Lc  '■  —*  R  the  minimum 

envelope  of  the  linear  functions  that  correspond  to  the  vectors  in  C,  i.e. , 

Lc{ A)  =  min  cT A  . 

Denote  by  Lc  :  Rd  — * ►  R  the  function  given  by 

Lc{ A)  =  Lc( A)  . 

The  vectors  in  C  C  i?d+1  are  called  the  pieces  of  g.  For  a  piece  c  €  C  and  a  vector 
/3  €  Rd  such  that  cT/3  =  g((3),  we  say  that  c  is  active  at  /3. 

Definition  4.2.2  For  a  function  g  :  Q  — >  R,  where  Q  C 

i.  Der^  by  Ag  (or  A,  for  brevity)  the  set  of  maximizers  of  5(A).  The  set  A  may 
be  empty. 

ii.  An  r'gorithm  that  computes  the  function  g  (i.e.,  for  A  6  Q  returns  the  value 
5(A)  and  otherwise  stops  or  returns  an  arbitrary  value)  is  called  piecewise  affine, 
if  the  operations  it  performs  on  intermediate  values  that  depend  on  the  input 
vector  are  restricted  to  additions,  multiplications  by  constants,  comparisons, 
and  copies. 

iii.  For  a  piecewise  affine  algorithm  4,  Uv.note  by  T(A)  and  C(A)  the  maximum 
number  of  operations  and  the  maximum  number  of  comparisons,  respectively, 
performed  by  A.  We  assume  that  this  numbers  are  finite. 

iv.  If  g  =  Lc  for  some  C  C  Rd+1,  we  say  that  g'  =  Lc  is  a  weak  approximation  of 
g ,  if  the  set  of  pieces  of  g'  is  a  subset  of  the  set  of  pieces  of  g  (C1  C  C),  and  the 
affine  hulls  aff  Ag  and  aff  Ag>  are  equal.  The  function  g'  =  Lc  is  a  minimal  "weak 
approximation  of  g,  if  there  is  no  C"  C  C'  such  that  Lc  is  a  weak  approximation 
of  g. 

Remark  4.2.3  Suppose  that  A  is  a  piecewise  affine  algorithm.  Consider  the  compu¬ 
tation  tree  (i.e.,  the  tree  consisting  of  all  possible  computation  paths)  of  A.  Observe 
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that  all  the  intermediate  values  along  a  computation  path,  including  the  final  out¬ 
put,  can  be  expressed  as  linear  functions  (i.e..  are  of  the  form  aT A)  of  the  input 
vector.  These  linear  functions  can  be  easily  computed  and  maintained  during  a  sin¬ 
gle  execution  of  the  algorithm.  These  linear  functions  map  the  input  vectors  whose 
computation  path  coincides  so  far  with  that  same  path  to  the  corresponding  value. 
Moreover,  the  linear  function  which  corresponds  to  the  fined  output  at  a  single  exe¬ 
cution  is  a  piece  which  is  active  at  the  input  vector. 

Remark  4.2.4  Suppose  g  :  Q  C  Rd  — * -  R  is  concave  and  computable  by  a  piecewise 
affine  algorithm  A.  It  is  easy  to  see  that  there  exists  a  finite  set  C  C  Rd 4-1  such  that 
g  coincides  with  Lc- 

Definition  4.2.5  Suppose  Q  =  fl  •  •  •  fl  Fk  is  the  intersection  of  k  closed  halfspaces 
and  g  :  Q  — >  R,  g  =  Lc,  is  concave  and  computable  by  a  piecewise  affine  algorithm. 

i.  For  (3  €  Rd,  denote 

Denote  by  g0  :  Q0  C  Rd 
which  are  active  at  (3, 

g0{\)  =  nun{cTA  :  cT ( 3  =  g{(3)}  . 

Note  that  g0  =  Lc  where  C'  =  {c  6  C  |  cTj3  =  g(f. 3)}.  See  Figure  4.1  for  an 
example.  Later,  we  describe  an  algorithm  for  evaluating  g0. 

ii.  For  a  given  sequence  of  vectors  /3:, . . .  ,(3t  €  R*1,  denote  by  g0l...0t  the  function 
((•  •  •  (9fi i  )ft)  •  •  -W  Note  that  g0i...0t  depends  on  the  order  of  the  /?i’s. 

iii.  Suppose  an  ^-dimensional  flat  S  C  Rd,  is  represented  as  a  set  of  solutions  of  a 
linear  system  of  equations.  There  exists  an  affine  mapping  M  from  Rl  onto  S, 
which  can  be  computed  in  0(<P)  time.  Denote  Qs  =  {A  €  Rl  |  M( A)  €  Q}, 
and  define  gs  :  Qs  — ^ >  R  by  gs  =  g  o  M. 


Q0  =  n {Fi  :  0  €  dFi}  . 

— »  R  the  function  whose  pieces  are  all  the  pieces  of  g 
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Figure  4.1:  Examples  of  restrictions  of  g 

Proposition  4.2.6  Let  A  be  a  piecewise  affine  algorithm  for  evaluating  g  :  Q  —*  R, 
vihere  Q  C  Rd  is  given  as  the  intersection  of  k  halfspaces.  By  modifying  A  we  can 
obtain  the  following  piecewise  affine  algorithms: 

i.  For  any  given  vector  (3  €  Q,  we  obtain  an  algorithm  A0  for  evaluating  g0  so 
that  T(A0)  =  T{A)  +  dC(A)  and  C(A0)  =  C{A). 

ii.  For  any  l-dimensional  flat  S  C  Rd,  represented  as  the  set  of  solutions  of  a 
system  of  linear  equations,  we  obtain  an  algorithm  As  for  evaluating  gs  so  that 
T(AS )  =  T{A)  +  0{ld)  and  C(^s)  =  C(A). 

Proof:  Part  ii  is  straightforward,  since  we  can  choose  the  algorithm  As  to  be  a 
composition  of  the  appropriate  affine  mapping  and  A.  We  discuss  the  construction 
of  the  algorithm  A0  for  part  i.  Consider  an  input  vector  A  €  Q0.  Let  e  >  0  be 
such  that  for  all  e'  (0  <  e'  <  e),  (3  +  e'(A  —  /3)  €  Q,  and  the  set  of  pieces  of  g 
which  are  active  at  (3  +  e'(A  —  (3)  is  equal  to  the  set  of  pieces  which  are  active  at 
/3  -f  e(A  —  /3).  It  is  immediate  to  see  that  such  an  e  always  exists.  It  follows  from 
the  definition  that  g0( A)  is  the  value  of  A  at  the  linear  pieces  of  g  which  are  active 
at  (3  +  e(A  —  (3).  The  algorithm  A0,  when  executed  with  an  input  A,  follows  the 
computation  path  of  A  which  corresponds  to  the  input  /3  +  e(A  — /3).  The  algorithm 
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computes  the  linear  functions  associated  with  the  intermediate  values  of  this  path 
(see  Remark  4.2.3).  Recall  that  the  linear  function  which  corresponds  to  the  final 
value  is  a  piece  of  g  which  is  active  at  (3  —  e(A  —  (3).  Hence,  the  value  of  gg{A)  is 
obtained  by  substituting  A  in  this  linear  function.  In  order  to  follow  the  desired  run 
of  A,  the  algorithm  Ag  mimics  the  work  of  A  on  additions  and  multiplications  by 
scalars,  keeping  track  of  linear  functions  rather  than  just  numerical  values.  When 
the  run  of  A  reaches  a  comparison  (branching  point),  Ag  does  as  follows.  Without 
loss  of  generality  we  assume  that  the  branching  is  according  to  the  sign  of  the 
linear  function  aTx.  In  order  to  decide  what  to  do  at  a  branching  point,  Ag  has 
to  determine  the  sign  of  aTx  at  the  point  x  =  (3  e(A  —  /3).  Since  e  is  not  given 
explicitly,  the  decision  cannot  be  made  directly  by  substitution.  The  decision  is 
made  as  follows.  The  algorithm  first  computes  a  =  aT(3.  If  a  ^  0,  then  obviously 
for  any  vector  y ,  for  sufficiently  small  number  e  >  0  ar(/3  +  ey)  has  the  same  sign 
as  a.  In  particular  this  holds  for  y  =  A  —  (3  and  the  sign  is  detected.  Otherwise,  if 
a  =  0,  it  follows  that  aT(( 3  -f  e(A  —  /3))  =  eaT A.  Hence  orA  has  the  same  sign  as 
aT(/3  +  e(A  —  (3)).  It  remains  to  compute  the  sign  of  arA  and  branch  accordingly. 
It  is  easy  to  verify  that  Ag  evaluates  the  function  gg  for  any  vector  A  €  Qg,  and 
performs  the  stated  number  of  operations,  i 

Proposition  4.2.7  If  (3  €  Afl  then  gg  is  a  weak  approximation  of  g  (see  Proposi¬ 
tion  8  1.9  for  a  proof). 

The  goal  is  to  solve  the  following  problem: 

Problem  4.2.8  The  input  of  this  problem  consists  of  a  polyhedron  Q  =  FiC\-  ■  -HF*, 
given  as  the  intersection  of  k  closed  halfspaces  and  a  piecewise  affine  algorithm  A  for 
evaluating  a  concave  function  g  :  Q  — *  R.  Decide  whether  or  not  g  is  bounded.  If 
so,  then  find  a  A'  €  relint  A.  We  refer  to  the  following  as  the  “optional”  part  of  the 
problem:  If  g  is  bounded,  then  find  a  subset  C  of  the  set  of  pieces  of  g,  such  that  Lc 
is  a  minimal  weak  approximation  of  g,  and  \C\  <  2d. 

The  set  C  may  be  viewed  as  a  certificate  for  the  fact  that  the  maximum  of  the 
function  g  does  not  exceed  p(A’).  In  the  current  chapter  we  do  not  discuss  the  details 
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of  solving  the  optional  part  of  the  problem.  See  Chapter  3  for  an  existence  proof  and 
an  algorithm  which  finds  such  a  set. 

We  propose  an  algorithm  for  Problem  4.2.8.  In  any  fixed  dimension  d,  the  total 
number  of  operations  performed  by  this  algorithm  is  bounded  by  a  polynomial  in  T{A) 
and  k.  The  algorithm  is  based  on  solving  instances  of  a  subproblem,  which  we  call 
hyperplane  query:  For  a  given  hyperplane  Ho,  decide  on  which  side  of  H0  the  function 
g  is  either  unbounded  or  attains  its  maximum.  A  procedure  for  hyperplane  queries  is 
called  an  oracle.  Obviously,  an  oracle  can  be  utilized  to  perform  a  binary  search  over 
the  polyhedron  Q.  However,  in  order  to  attain  an  exact  solution  within  time  bounds 
that  depend  only  on  d ,  T ,  and  k,  we  use  the  oracle  in  a  more  sophisticated  way. 
The  number  of  hyperplane  queries  needed  by  the  algorithm,  and  hence  the  number 
of  oracle  calls,  is  bounded  by  the  number  of  comparisons  performed  by  A.  We  later 
discuss  applying  the  multi-dimensional  search  technique,  what  allows  us  to  do  even 
better.  By  exploiting  the  parallelism  of  A,  the  number  of  oracle  calls  can  in  some 
cases  be  reduced  to  a  polylogarithm  of  the  number  of  hyperplane  queries. 

The  function  g  is  a  concave  piecewise  linear  mapping.  Concave  functions  have 
the  property  that  it  can  be  effectively  decided  which  side  of  a  given  hyperplane  H0 
contains  the  maximum  of  the  function.  If  the  domain  of  g  does  not  intersect  Ho,  then 
the  answer  is  the  side  of  Ho  which  contains  the  domain  of  g.  Otherwise,  the  decision 
can  be  made  by  considering  a  neighborhood  of  the  maximum  of  the  function  relative 
to  Ho,  searching  for  a  direction  of  ascent  from  that  point.  This  principle  is  explained 
in  detail  in  [47]. 

For  a  hyperplane  Ho  C  R?,  we  wish  to  decide  on  which  side  of  Ho  the  set  relint  A 
lies.  By  solving  a  linear  program  with  d  variables  and  k  + 1  constraints,  we  determine 
whether  or  not  ff0  H  2  =  0,  and  if  so,  we  determine  which  side  of  Ho  contains  Q.  It 
follows  from  [47]  that  this  can  be  done  in  0(k)  time.  If  H0  D  Q  ^  0,  then  the  oracle 
problem  solves  the  original  problem,  when  g  is  restricted  to  H0.  If  g  is  unbounded  on 
Ho  the  oracle  reveals  that.  If  A  =  0,  or  if  relint  A  is  either  contained  in  Ho  or  extends 
into  both  sides  of  Ho  (i.e.,  Ho  H  relint  A  ^  0),  then  we  find  A  €  Ho  fl  relint  A  and  the 
oracle  will  actually  solve  Problem  4.2.8. 
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Problem  4.2.9  Given  are  a  set  Q  =  F\  fi  •  •  •  r  Fk,  a  piecewise  affine  algorithm  A 
which  evaluates  a  concave  function  g  :  Q  — >  R,  and  a  hvperplane  H0  in  Rd.  Do  as 
follows: 

i.  If  Q  Ho  =  0,  recognize  which  of  the  two  halfspaces  determined  by  Ho  contains 
Q.  Otherwise, 

ii.  recognize  whether  or  not  g  is  bounded  on  Ho-  If  it  is,  then 

iii.  find  A  6  H0  fl  relint  A  if  such  A  exists,  and  solve  Problem  4.2.8  relative  to  g. 
Otherwise,  if  H0  fl  relint  A  =  0,  then 

iv.  recognize  which  of  the  two  halfspaces  determined  by  Ho  has  either  a  nonempty 
intersection  with  relint  A,  or  has  g  unbounded  on  it. 

A  procedure  for  solving  Problem  4.2.9  will  be  called  an  oracle  and  the  hyperplane  Ho 
will  be  called  the  query  hyperplane.  Problem  4.2.8  is  solved  by  running  a  modification 
of  the  algorithm  A,  where  additions  and  multiplications  are  replaced  by  vector  opera¬ 
tions  and  comparisons  are  replaced  by  hyperplane  queries.  Problem  4.2.9  is  solved  by 
three  recursive  calls  to  instances  of  Problem  4.2.8  of  the  form  (AB ,  QH),  (AB,  Qjf), 
where  /3  €  R*,  ajid  H  is  a  hyperplane  (see  Definitions  4.2.5  and  4.2.6).  Note  that  these 
algorithms  compute,  respectively,  the  functions  gB  :  QB  —*  R,  gB  :  QB  —*  R,  where 
QB  and  QB  are  subsets  of  R?~l .  Hence,  the  recursive  calls  are  made  to  instances  of 
lower  dimension. 

In  Section  4.5  we  propose  Algorithm  4.5.2  for  Problem  4.2.8.  The  algorithm 
executes  calls  to  the  oracle  problem  (Problem  4.2.9)  relative  to  g.  An  algorithm  for 
the  oracle  problem  is  given  in  Section  4.3.  A  call  to  the  oracle  is  costly.  Therefore, 
one  wishes  to  solve  many  hyperplane  queries  with  a  small  number  of  oracle  calls.  In 
Section  4.4  we  discuss  the  multi-dimensional  search  technique  (introduced  in  [47]). 


4.3  Hyperplane  queries 

For  a  hyperplane  H  C  R?}  we  solve  Problem  4.2.9  for  g  relative  to  H . 


4.3.  HYPERPLANE  QUERIES 
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Theorem  4.3.1  Problem  f.2.9  can  be  reduced  to  the  problem  of  solving  three  in¬ 
stances  of  Problem  4'2.8  on  functions  defined  on  an  intersection  of  at  most  k  closed 
halfspaces  m  The  time  complexity  of  the  additional  computation  is  0(d3). 

Proof:  We  solve  Problem  4.2.8  with  the  function  gH ,  where  H  =  {A  £  Rd  aT A  = 
a}.  If  g  is  unbounded  on  H,  then  this  fact  is  detected;  otherwise,  suppose  A^  is  in 
the  relative  interior  of  the  set  of  maximizers  of  g(A)  subject  to  A  €  H ,  and  we  get 
the  collection  Let  =  <7(A^).  We  wish  to  recognize  whether  A^  is  also  a 
relative  interior  point  of  the  set  of  global  maxima  (i.e. ,  relative  to  Rd).  If  not,  then 
we  wish  to  decide  whether  for  all  A’  £  Rd  such  that  g(A')  >  <7(A^),  necessarily 
aT Aw  >  a,  or  whether  for  all  of  them  aT A"  <  a.  These  are  the  two  possible 
cases.  Consider  the  function  gX( o>.  We  solve  Problem  4.2.8  on  two  restrictions  of 
gxi o)  to  hyperplanes  (see  Proposition  4.2.6),  where  in  one  case  it  is  restricted  to 
iifri)  =  {A  |  aT A  =  a  —  1),  and  in  the  other  to  H =  {A  |  aT A  =  a  -f  1}.  Note 
that  the  domains  €  {-1, 1})  are  ( d - l)-dimensional.  Denote  the  respective 

optimal  values  of  gffi  by  t^  ( 6  £  {-1,1}),  and  let  C6  be  the  respective  minimal 
weak  approximations.  Only  one  of  the  optimal  values  can  be  greater  than 

If  this  is  the  case,  or  if  one  of  equals  and  the  other  is  smaller, 

then  the  side  of  the  hyperplane  that  contains  relint  A  is  determined.  Otherwise, 
if  both  values  are  less  than  or  both  values  are  equal  to  t^°\  then  t^  is  the  global 
optimal  value.  In  the  latter  case  A^  £  relint  A.  It  follows  from  Proposition  4.2.7 
that  the  pieces  of  g  active  in  a  minimal  weak  approximation  have  the  value  t ^  at 
A^.  Thus,  a  minimal  weak  approximation  of  the  function  gX(o)  is  a  minimal  weak 
approximation  of  g.  It  follows  from  analysis  done  in  Chapter  3  that  by  using  0(d °) 
operations  we  can  construct  a  minimal  weak  approximation  of  gX( <».  Furthermore, 
the  number  of  pieces  involved  in  a  minimal  weak  approximation  is  at  most  2d.  | 

As  an  example,  consider  an  application  of  the  algorithm  described  in  the  proof 
to  decide  on  which  side  of  the  hyperplane  H  =  {2}  the  function  p(A)  =  min{A/5  + 
2,  — 4A  +  12.5}  is  maximized  (see  Figure  4.2).  Note  that  maximizing  a  function  /  : 
R  — *  R  on  a  hyperplane  corresponds  to  evaluating  it  at  a  single  point.  Therefore,  the 
maximum  value  of  g  on  H  is  2.4.  The  algorithm  considers  the  restriction  gj  =  A/5+2, 
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Figure  4.2:  Example:  hyperplane  query  at  H  —  {2} 

and  maximizes  it  on  the  hyperplanes  H M  =  {1}  and  =  {3}.  The  corresponding 

maxima  are  t =  2.2  and  =  2.6,  and  hence,  the  algorithm  concludes  that  the 
maximizers  of  g  are  contained  in  the  halfspace  {A  €  #|A  >  2}.  Observe  that  this 
conclusion  could  not  have  been  made  if  the  algorithm  considered  the  values  of  g, 
rather  than  the  values  of  the  restriction  <72,  at  the  hyperplanes  {1}  and  {3}. 


4.4  Employing  multi-dimensional  search 

The  definitions  and  propositions  stated  in  this  section  appeared  in  Section  3.3.2.  They 
are  presented  here  to  allow  for  an  independent  reading  of  this  chapter.  For  proofs, 
the  reader  is  referred  to  Section  3.3.2.  The  multi-dimensional  search  problem  was 
defined  and  used  in  [47]  for  solving  linear  programming  problems  in  fixed  dimension. 
In  this  section  we  employ  it  to  achieve  better  time  bounds. 

Definition  4.4.1  We  define  a  partial  order  on  i2*\{0},  relative  to  a  concave  function 
g  :  Q  —*  R,  where  Q  C  R?~x  is  a  nonempty  polyhedral  set.  For  any  pair  of  distinct 
vectors  a1;  a2  €  R?,  denote 


H  =  tf(ai,a2)  =  {A  €  R?'1  :  afA  =  afA}  . 


4.4.  EMPLOYING  MULTI-DIMENSIONAL  SEARCH 


Figure  4.3:  An  example  where  (4,4,5)  <a  (2,5,7) 

If^  is  unbounded  on  ff(ai,a2)  or  if /f (01,02) Hrelint  A  ^  0,  then  we  write  ai  <>A  a2. 
Otherwise,  g  can  be  unbounded  on  at  most  one  of  the  open  halfspaces  determined 
by  H,  and  also  relint  A  can  intersect  at  most  one  of  these  open  halfspaces.  If  g  is 
undefined  on  II  (i.e.,  Q  D  H  =  0),  then  Q  is  contained  in  one  of  these  halfspaces. 
We  denote  aa  <A  a2  (respectively,  Oi  >A  o2)  if  there  exists  a  A  6  relint  A  such  that 
of  A  <  a\ A  (respectively,  of  A  >  of  A),  in  which  case  the  same  holds  for  all  these 
A’s,  or  if  g  is  unbounded  on  the  halfspace  determined  by  the  inequality  of  A  <  a^A 
(respectively,  of  A  <  o^A).  See  Figure  4.3  for  an  example.  We  also  use  the  notation 
<P  for  a  similar  partial  order  relative  to  any  set  P. 


Problem  4.4.2  Given  are  finite  sets  Ai,...,Ar  of  nonzero  vectors,  where  Ai  = 
{aj, . . . ,  a* .}  (alj  €  R?)  and  s  =  We  wish  either  to  find  a  minimal  element, 

with  respect  to  the  partial  order  <A,  in  each  of  the  sets  Ai,  or  (if  we  encounter  two 
incomparable  elements)  to  reduce  the  problem  to  a  lower  dimension.  More  specifically, 
we  need  to  do  either  one  of  the  following: 

i.  Find  a  collection  of  closed  halfspaces  whose  intersection  P  contains  relint  A, 

and  indices  1  <  m*  <  (»  —  1, . . .  ,r)  as  follows.  For  every  1  <  t  <  r  and  every 

1  <  j  <  *i,  j  ^  we  have  <A  aj  and  a j*.  <P  a). 

ii.  Find  a  hyperplane  H  such  that  either  g  is  unbounded  on  H  or  H  n relint  A  ^  0. 
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Proposition  4.4.3  Problem  4-4-2  can  be  solved  using  0(~f(d  —  l)log.s)  oracle  calls 
plus  additional  computation  which  can  be  performed  in  either 

i.  0(i{d  —  1 )  log 2  5 )  parallel  time  on  0(s)  processors,  or 

ii.  0{~i{d  —  l)slogi)  sequential  time. 

The  function  'y(d)  arises  from  the  multi- dimensional  search  [41]-  It  follows  from  [5, 
19]  that  7 (d)  =  3°^\ 

4.5  The  algorithm 

The  algorithm  described  below  solves  Problem  4.2.8.  It  finds  a  vector  A”  G  relint  A, 
unless  g  is  unbounded.  It  also  returns  a  collection  C  of  pieces  of  g  whose  minimum 
envelope  Lc  is  a  minimal  weak  approximation  of  g.  The  number  of  vectors  in  C  is  at 
most  2d. 

Definition  4.5.1  For  a  piecewise  affine  algorithm  A,  we  define  the  corresponding 
lifted  computation.  The  lifted  computation  is  a  run  of  the  algorithm  on  a  set  of  inputs. 
The  computation  is  done  symbolically  on  linear  functions  instead  of  on  scalars.  It 
follows  the  path  on  the  computation  tree  of  A  that  corresponds  to  input  vectors 
which  are  The  additions  and  scalar  multiplications  are  trivially  generalized  to 

operation!  ear  functions.  When  a  comparison  is  done  between  f\ A  and  f2 A,  it 
is  resolved  according  to  the  partial  order  <a-  We  compute  the  hyperplane 
and  solve  Problem  4.2.9  (hyperplane  query)  relative  to  H.  The  hyperplane  query 
decides  whether  or  not  the  vectors  are  comparable.  If  they  are,  it  decides  whether 
fi  <a  ft-  H  f\  <>A  /a>  then  the  lifted  computation  halts  since  an  oracle  call 
resulted  in  a  solution  to  Problem  4.2.8.  Otherwise,  the  resolved  hyperplane  query 
tells  us  which  of  the  halfspaces  defined  by  contains  the  set  relint  A,  and 

the  comparison  is  resolved. 

Sets  of  independent  comparisons  performed  by  A  correspond  to  sets  of  indepen¬ 
dent  hyperplane  queries.  Recall  from  Section  4.4  that  a  set  of  independent  hyperplane 
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queries  can  be  solved  by  performing  a  logarithmic  number  of  ‘'oracle''  calls.  The  lifted 
computation  maintains  a  set  7i  of  closed  halfspaces  which  is  initially  empty.  When¬ 
ever  an  oracle  call  is  executed  the  resulting  halfspace  is  added  to  H. 

Algorithm  4.5.2  [Find  a  vector  A  £  relint  Afl] 

Step  1.  Run  the  lifted  computation,  collecting  into  TL  all  the  halfspaces  resulting 
from  oracle  calls  where  comparisons  are  resolved.  If  the  computation  halts, 
then  some  comparison  is  not  resolved  but  a  global  solution  is  found,  so  stop. 
Otherwise,  denote  by  m  =  (mi, . . . ,  m^+  :)r  £  Rd+1  the  piece  of  g  that  corre¬ 
sponds  to  the  computation  path  followed. 

Step  2.  Denote  by  P  the  intersection  of  the  halfspaces  in  H. 

i.  Compute  A”  €  relint  P  C\Q.  This  amounts  to  a  linear  programming  prob¬ 
lem  with  d  variables  and  \H\  constraints,  and  hence  it  can  be  solved  in 
0( \H\)  sequential  time  [47].  Note  that  the  size  of  H  is  bounded  by  the 
number  of  oracle  calls. 

ii.  If  L{m}  is  not  constant  on  R?,  that  is,  not  all  of  m1;  m2, . . . ,  mj  equal  zero, 
then  g  is  unbounded.  Otherwise, 

iii.  consider  g(X~)  =  mj+  j.  The  function  L{m}  is  a  weak  approximation  of  g, 
and  P  =  A.  Hence,  A"  £  relint  A.  Output  A"  and  C  =  {m}. 


4.6  Correctness 

If  an  oracle  call  results  in  a  solution  during  Step  1  of  Algorithm  4.5.2,  then  correctness 
follows  by  induction  on  the  dimension.  We  now  assume  that  no  oracle  call  resulted  in 
a  solution  during  Step  1.  In  this  case,  a  collection  H  of  closed  halfspaces  is  obtained. 
Recall  that  if  an  oracle  call  on  a  hyperplane  H  did  not  result  in  a  solution,  then  the 
halfspace  F  returned  has  the  following  properties:  (i)  if  the  function  g  is  bounded 
then  AC  F  but  A  £  (ii)  if  the  function  g  is  unbounded,  then  it  must  be  bounded 
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on  the  hyperplane  H ,  and  unbounded  on  the  halfspace  F.  Let  P  be  the  polyhedron 
P  —  fVgft  F •  ^  follows  that  if  g  is  bounded  then  P  D  A,  and  if  g  is  unbounded 
then  it  must  be  bounded  outside  and  on  the  boundary  of  P.  Note  that  P  must  be  of 
full  dimension  (dim  P  —  d ),  for  if  not,  then  it  must  be  contained  in  one  of  the  query 
hvperplanes,  which  contradicts  the  previous  statement. 

Observe  that  for  all  pairs  aa,a2  of  vectors  compared  by  the  lifted  computation, 
one  of  the  following  must  hold:  either  a 2  <a  and  a2  <p  a2,  or  a2  <a  a2  and 
a2  <p  a2.  The  latter  is  obvious  when  we  call  the  oracle  to  resolve  each  hyperplane 
query,  and  it  is  easy  to  see  that  it  still  holds  when  we  employ  the  multi-dimensional 
search  technique  (see  Problem  4.4.2  and  Proposition  4.4.3)  and  solve  these  hyperplane 
queries  by  a  smaller  number  of  cracle  calls.  Thus,  the  piece  m  (the  maximizer)  found 
by  the  lifted  computation  must  satisfy  m  <\  c  and  hence  m  <p  c  for  all  pieces  c  of 
g.  It  follows  that  g( A)  =  mT A  for  all  A  €  P.  Thus,  g  is  unbounded  if  and  only  if 
L{my  is  not  constant,  and  the  correctness  of  step  ii  follows.  To  show  the  correctness 
of  step  iii  assume  that  £{m}  is  constant,  and  thus  g  —  m^+1  for  all  A  €  P.  Since 
P  D  A  we  have  P  =  A.  It  follows  that  A“  €  relint  A,  aff  A  =  R?,  and  £{m}  is  a 
minimal  weak  approximation  of  g. 


4.7  Complexity 


Consider  the  algorithm  A.  Suppose  that  the  C(A)  comparisons  performed  by  A  can 
be  divided  into  r  phases,  where  Ci  independent  comparisons  are  performed  during 
phase  t  (i  =  1, . . .  ,r).  It  follows  from  Proposition  4.4.3,  that  the  lifted  computation 
can  be  implemented  in  such  a  way  that  it  performs  7 (d)  [log  Cj]  oracle  calls. 

It  follows  from  Theorem  4.3.1  that  each  oracle  call  involves  three  recursive  calls  to 
instances  of  Problem  4.2.8  of  lower  dimension.  The  piecewise  affine  algorithms  that 
correspond  to  these  instances  have  the  same  number  of  comparisons  as  A ,  divided 
into  phases  in  the  same  way,  and  0(d)  times  more  operations.  Thus,  the  total  number 


4.8.  PARAMETRIC  EXTENSIONS  OF  PROBLEMS 


81 


of  operations  needed  for  the  lifted  computation  is 

J>g  )J . 

i=l 

The  number  of  parallel  phases  needed  in  the  above  computation  is  bounded  by 
the  product  of  the  number  of  phases  of  the  algorithm  A  with  £I=i  [log  C i]  )d.  If 
the  algorithm  A  is  inherently  sequential,  then  the  total  number  of  operations  is 

0{kT{A)C(A)d). 

4.8  Parametric  extensions  of  problems 

The  technique  described  in  this  chapter  was  employed  in  Chapter  3  to  get  algorithms 
for  the  parametric  extensions  of  the  minimum  cycle  and  the  minimum  cycle-mean 
problems.  This  technique  can  be  applied  to  a  variety  of  other  problems,  where  we 
consider  a  strongly  polynomial  algorithm  for  a  problem  and  obtain  a  strongly  poly¬ 
nomial  algorithm  for  a  parametric  extension  of  the  problem  (when  the  number  of 
parameters  is  fixed).  We  state  the  conditions  where  this  technique  is  applicable  and 
present  applications. 

Definition  4.8.1  [Parametric  extensions] 

i.  A  problem  S  :  V  — ►  R  is  a  mapping  from  a  set  V  of  instances  into  the  set  of 
real  numbers.  We  say  that  S(P)  is  the  solution  of  the  problem  for  the  instance 
P  €  V.  Suppose  that  every  instance  P  €  V  has  a  size  |(P||  associated  with  it. 
The  size  of  an  instance  is  not  necessarily  defined  to  be  the  number  of  bits  in  its 
representation.  It  may  be  any  natural  parameter  (for  example,  the  number  of 
edges  in  a  weighted  graph). 

ii.  Let  A  be  an  algorithm  that  computes  S(P).  Denote  by  7U(P)  the  number  of 
elementary  operations  the  algorithm  performs  on  the  instance  P.  The  algorithm 
A  is  polynomial  if  T*(P)  =  0(p(||P||))  for  some  polynomial  p(«). 
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iii.  A  d-parametric  extension  Pd  =  (M.,  Q )  of  V  is  defined  as  follows,  where  Q  C  Rd 
is  a  polyhedron  given  as  an  intersection  of  k  halfspaces,  and  M.  :  Q  — *  V  is  a 
mapping  from  points  A  €  Q  to  instances  of  V.  The  extension  Pd  corresponds 
to  a  subset  of  instances  {.M(A)  |  A  t  Q]  C  V-  We  refer  to  -M(A)  €  V  as 
the  instance  of  V  induced  by  A.  For  an  extension  Pd,  we  define  g  :  Q  —*  R 
as  a  mapping  from  vectors  A  6  Q  to  the  solution  of  the  corresponding  induced 
instance  g(\)  =  S(.M(A)).  A  solution  of  the  parametric  extension  Pd  is  defined 
as  follows.  Consider  the  maximum  of  g( A).  If  it  is  finite,  a  solution  consists 
of  the  maximum  and  a  vector  A  £  Rd  that  belongs  to  the  relative  interior  of 
the  set  of  vectors  which  maximize  S.  Formally,  if  Q  is  empty  or  if  S(.M(A))  is 
unbounded  on  Q,  these  facts  are  recognized.  Otherwise,  a  pair  (m,  A")  €  RxRd, 
where  m  =  maxAeg#(A),  and  €  relint{A  |  g( A)  =  m}  is  computed.  We 
denote  T  =  max^gc  TU(A4(A)). 

Theorem  4.8.2  Let  S  :  V  — >  R  be  a  problem  in  the  sense  of  Definition  4-8.1.  Let 
A  be  an  algorithm  that  evaluates  S,  and  let  Pd  =  (M,Q)  (where  |Q|  =  k)  be  a 
corresponding  parametric  extension.  We  assume  that 

i.  the  function  g  is  concave, 

ii.  the  mapping  M.  is  computable  by  a  piecewise  affine  algorithm  Am  (see  Defini¬ 
tion  4.2.2)  in  less  than  T  operations,  and 

iii.  the  combined  algorithm  which  computes  an  instance  .Am(A)  6  V  and  applies  A 
to  Am(A),  is  piecewise  affine. 

Denote  by  C  the  maximum  (over  A  €  Q)  number  of  comparisons  performed  by  the 
combined  algorithm.  Suppose  the  comparisons  can  be  divided  into  r  sets  of  sizes 
Ci, ...  ,CT  (C  =  C,)  such  that  the  algorithm  runs  in  r  phases,  where  Ci  inde¬ 

pendent  comparisons  are  performed  in  phase  i. 

Under  these  conditions,  the  d-parametric  extension  Pd  can  be  solved  using 
0(d)kT(^2  flog  Ci]  )d  operations,  where  /3(d)  =  3°^  . 
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Remark  4.8.3  In  the  above  formulation  we  defined  a  problem  as  a  mapping  into 
the  set  of  real  numbers  5  :  V  —*  R.  The  results  generalize  to  cases  where  the  range 
of  S  is  Rl  for  l  >  1  and  the  notions  of  maximum  and  concavity  of  g  are  defined  with 
respect  to  the  lexicographic  order  as  discussed  in  the  introduction. 

Below  we  present  some  applications  of  Theorem  4.8.2.  Additional  applications 
were  found  by  Norton,  Plotkin,  and  Tardos  [50]. 

Adding  variables  to  LP’s  with  two  variables  per  inequality  Linear  program¬ 
ming  problems  with  at  most  two  variables  in  each  constraint  and  in  the  objective 
function  were  shown  to  have  a  strongly  polynomial  time  algorithm  by  Megiddo  [46]. 
Lueker,  Megiddo  and  Ramachandran  [43]  gave  a  polylogarithmic  time  parallel  algo¬ 
rithm  for  the  problem  which  uses  a  quasipolynomial  number  of  processors.  The  best 
known  time  bounds  for  the  problem  are  presented  in  Chapter  5.  Cosares,  using  nested 
parametrization,  extended  Megiddo’s  strong  polynomiality  result  to  allow  objective 
functions  which  have  a  fixed  number  of  nonzero  coefficients.  This  result  can  be  fur¬ 
ther  extended  to  include  the  following.  For  a  fixed  d ,  we  consider  linear  programming 
problems  as  above,  but  we  allow  certain  d  additional  variables  to  appear  anywhere  in 
the  constraints  and  in  the  objective  function  without  being  “counted.”  This  problem 
is  a  d- parameter  extension  of  the  two  variables  per  constraint  problem,  where  the 
“parameters”  are  the  d  additional  variables.  For  each  choice  of  values  for  the  pa¬ 
rameters  we  have  a  corresponding  induced  system  with  two  variable  per  constraint. 
It  is  easy  to  verify  that  the  conditions  of  Theorem  4.8.2  hold.  Hence,  this  class  of 
problems  also  has  a  strongly  polynomial  time  algorithm,  and  a  polylogarithmic  time 
parallel  algorithm  which  uses  a  quasipolynomial  number  of  processors. 

Parametric  flow  problems  Theorem  4.8.2  was  applied  in  [10]  to  generate  strongly 
polynomial  algorithms  for  parametric  flow  problems  with  a  fixed  number  of  param¬ 
eters  and  to  some  constrained  flow  problems  with  a  fixed  number  of  additional  con¬ 
straints.  Complementing  results  showing  the  P-completeness  of  these  problems  when 
the  number  of  parameters  is  not  fixed,  were  also  given. 
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Chapter  5 

Linear  systems  with  two  variables 
per  inequality 


We  show  that  a  system  of  m  linear  inequalities  with  n  variables,  where  each  inequality 
involves  at  most  two  variables,  can  be  solved  in  0{mm?)  time,  and  using  randomiza¬ 
tion,  in  0(n3  +  mn)  expected  time.  Parallel  implementations  of  these  algorithms 
run  in  0(n )  time,  where  the  deterministic  algorithm  uses  0(mn)  processors  and  the 
randomized  algorithm  uses  0(n2  +  m)  processors. 


5.1  Introduction 

In  this  chapter  we  consider  the  following  class  of  linear  systems: 

Definition  5.1.1  A  TVPI  system  is  a  system  of  linear  inequalities  where  each  in¬ 
equality  involves  at  most  two  variables  (i.e.,  a  system  of  the  form  Ax  <  b  ,  where 
A  €  /2mX”  is  a  matrix,  b  €  iZ"1  is  a  real  vector,  and  each  row  of  A  contains  at  most 
two  nonzero  entries).  We  denote  the  number  of  inequalities  by  m  ,  and  the  variables 
by  *i, . . . ,  *„.  We  denote  by  F  =  {x  €  RP\ Ax  <  6}  the  set  of  feasible  solutions. 

A  TVPI  system  is  called  monotone  if  the  two  nonzero  entries  in  each  row  have  opposite 
signs.  See  Figure  5.1  for  an  example  of  a  TVPI  system. 
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-x  +  y  £0  x  +  z  £  2 

-z  +  y  £0  y  £0 


Figure  5.1:  An  example  of  a  TVPI  system  with  3  variables 


Our  goal  is  to  either  find  a  point  that  satisfies  all  the  inequalities  or  conclude  that 
no  such  point  exists.  The  structure  of  TVPI  systems  enables  us  to  obtain  specialized 
algorithms  that  are  faster  than  known  algorithms  for  solving  general  linear  systems. 
The  algorithms  given  here  can  also  be  adapted  to  find  a  solution  that  minimizes  or 
maximizes  a  specific  variable.  We  summarize  previous  work  on  solving  TVPI  sys¬ 
tems.  Shostak  [57]  characterized  the  set  of  solutions  of  TVPI  systems  and  gave  an 
algorithm  that  is  exponential  in  the  worst  case.  Nelson  [49]  gave  an  n°(logn)  algo¬ 
rithm.  Aspvall  and  Shiloach  [4]  and  Aspvall  [3]  proposed  algorithms  that  perform 
0(mn3I)  and  0(mn2I)  arithmetic  operations,  respectively,  where  I  is  the  size  of  the 
binary  encoding  of  the  problem.  Megiddo  [46]  proposed  the  first  strongly  polyno¬ 
mial  time  algorithm  for  the  problem,  which  performs  0(mn3  log  to)  operations.  The 
parallel  implementation  of  Megiddo’s  algorithm  runs  in  0(n3  log  m)  time  using  0{m) 
processors. 

The  algorithms  presented  here  improve  the  sequential  and  parallel  time  bounds. 
We  give  an  0  (mn2  (log  m  +  log2  n))  time  deterministic  algorithm,  which  has  a  paral¬ 
lel  implementation  that  runs  in  O  (n(log  m  +  log2  n))  time  using  0{mn )  processors. 
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An  additional  improvement  is  obtained  through  using  randomization:  we  give  an  algo¬ 
rithm  that  runs  in  0  (n3  log  n  —  mn(log5  n  —  log  m  log3  n) j  expected  time.  A  parallel 
implementation  runs  in  0  (n(log5n  —  log3nlogm)j  expected  time  using  0(n 2  -r  m) 
processors.  The  effort  involved  in  translating  these  algorithms  into  actual  programs 
is  about  the  same  as  in  the  previously  known  algorithms,  and  the  hidden  constants 
in  the  time  bounds  are  still  reasonable.  The  space  requirement  of  these  algorithms 
is  0(n2  t  m ).  Hochbaum  and  Naor  [32]  found  a  new  0(mn2  logm)  deterministic 
algorithm  for  the  problem.  Their  algorithm,  however,  runs  in  0(n2  log  m)  time  in 
parallel,  and  it  does  not  seem  possible  to  combine  it  with  the  randomized  approach 
to  yield  algorithms  with  better  expected  time. 

Section  5.2  gives  the  preliminaries.  In  particular  we  discuss  the  subproblem  of 
locating  a  value:  for  a  numerical  value  £  and  1  <  i  <  n,  decide  whether  a  TVPI 
system  with  the  addition  of  either  z,  >  £  or  z,  <  £  remains  feasible.  An  0(mn) 
algorithm  (O(n)  time  0(m)  processors  in  parallel)  for  locating  a  value  was  given  by 
Aspvall  and  Shiloach  [4]. 

In  Section  5.3  we  introduce  a  framework  for  solving  TVPI  systems  and  analyze 
the  running  time  of  a  deterministic  algorithm  that  relies  on  locating  values.  We  use 
this  framework  to  reduce  solving  TVPI  systems  to  solving  a  poly  logarithmic  number 
of  instances  of  the  more  general  subproblem  of  locating  a  pool  of  values :  for  numerical 
values  £i , . . . ,  choose  bounds  Si, . . . ,  sn,  where  Si  €  {®»  >  <  &},  such  that  the 

system  with  the  additional  constraints  Si,...,sn  remains  feasible.  Obviously,  a  pool 
of  values  can  be  located  by  n  sequential  applications  of  an  algorithm  that  locates  a 
single  value.  This  yields  an  0(mn2)  time  sequential  algorithm  which  in  parallel,  runs 
in  0(n2)  time  using  0(m )  processors. 

In  Section  5.4  we  present  better  algorithms  for  locating  a  pool.  We  first  give  an  al¬ 
gorithm  that  improves  the  parallel  running  time  to  0(n)  using  0(mn)  processors.  We 
also  give  an  overview  of  a  two  stage  0(mn)  time  randomized  algorithm.  Sections  5.5 
and  5.6  are  concerned  with  the  details  of  the  two  stages. 

In  Section  5.7  we  discuss  the  special  structure  of  monotone  systems.  Section  5.8 
contains  concluding  remarks. 
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Figure  5.2:  An  example  of  a  system  of  inequalities  and  the  associated  graph 

5.2  Preliminaries 


In  this  section  we  introduce  some  terminology,  notation,  and  discuss  the  properties 
of  TVPI  systems.  We  define  some  basic  constructs  and  operations.  We  also  present 
a  modification  of  an  algorithm  by  Aspvall  and  Shiloach  [4]  for  locating  values. 

5.2.1  The  associated  graph 

We  represent  a  TVPI  system  by  a  set  of  n  intervals  and  a  graph  with  2n  nodes  and  2m 
edges.  The  graph  is  a  natural  representation  of  the  system,  where  variable?  correspond 
to  vertices  and  two- variable  inequalities  to  edges  between  vertices  of  the  participating 
variables.  Directed  paths  in  the  graph  yield  new  inequalities.  For  example,  the  edges 
who  correspond  to  a  >3y-f2,y>2  —  z,  and  z  >  1  —  2w  constitutes  a  directed 
path  of  length  3  which  yields  the  inequality  x  >  5  +  6w.  A  formal  definition  of  the 
associated  graph  follows. 
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Definition  5.2.1  Suppose  we  are  given  a  TYPI  system  as  in  Definition  5.1.1.  With¬ 
out  loss  of  generality,  assume  that  the  inequalities  with  a  single  variable  ( bounds )  are 
summarized  in  the  form  of  intervals  5,  =  [a*,  (  — oo  <  <  b{  <  oc,  i  =  1, . . . ,  n). 

We  consider  a  directed  graph  G  =  (V,E)  as  follows.  For  each  variable  2,,  there  are 
two  vertices  in  G  associated  with  2,-,  namely,  V  =  V U  F  where  V  =  {r,  l  i  =  1, . . .  ,71} 
and  V  =  {t^j  i  =  1, . . .  ,  n}.  For  u  €  V,  we  define  u-1  €  V  as  follows.  If  u  =  v{  €  V., 
then  u-1  =  Vi,  and  if  u  =  v,  €  V,  then  u-1  =  v The  edges  of  G  are  associated  wnth 
the  inequalities  that  involve  exactly  two  variables  as  follows.  Each  such  inequality  is 
represented  by  two  edges,  where  each  edge  e  is  labeled  with  a  certain  linear  function 
fe.  The  edges  corresponding  to  an  inequality  of  the  form 

72{  <  axj  +  0 


are  as  follows: 

i.  If  a  >  0  and  7  =  1,  we  have  an  edge  e  =  (vj,  Vi)  labeled  fc(x)  —  ax  +  (3  and  an 
edge  e"1  =  (lU,Vj)  labeled  fe-i(x)  =  ^2  -  f . 

ii.  If  a  <  0  and  7  =  1,  we  have  an  edge  e  =  (y_j,  vt)  labeled  /e(*)  =  ax  +  0  and  an 
edge  e~l  =  {jU,Vj)  labeled  /*-»(*)  ==  ax  ~  a  • 

iii.  If  a  >  0  and  7  =  —1,  we  have  an  edge  e  =  (Vj,  tjJ  labeled  /e(z)  =  —ax  —  0  and 
an  edge  e-1  =  (ui,v3)  labeled  /e- 1(2)  =  —^2  — 

See  Figure  5.2  for  an  example  of  such  a  system  and  the  associated  graph.  See  Fig¬ 
ure  5.3  for  the  associated  graph  of  the  system  shown  in  Figure  5.1.  We  assume 
throughout  this  chapter  that  a  TVPI  system  is  given  by  the  associated  graph  and  set 
of  intervals. 

For  any  one-to-one  function  /  let  f~l  denote  the  inverse  function.  In  particular,  if 
f(x)  =  ax  +  0  and  a  ^  0,  then  /-1(sc)  =  ^2  —  |.  Note  that  for  all  e  €  E,  fe-i  =  f~l - 

Let  G  be  as  in  Definition  5.2.1.  A  linear  function  associated  with  an  edge  (u,w) 
corresponds  to  an  inequality  in  the  original  system:  suppose  that  u  €  {1^,  v,};  if 
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Figure  5.3:  The  associated  graph  of  the  TVPI  system  of  Figure  5.1 

w  =  Uj  the  inequality  is  *i  <  /.(*  i),  and  if  w  =  v_j  the  inequality  is  Xj  >  fe(xi)- 
We  define  the  linear  function  associated  with  a  directed  path  p  from  u  to  w.  The 
corresponding  two  variable  inequality  which  results  from  treating  the  path  as  an  edge 
(u,  w),  is  implied  by  the  original  system. 

Definition  5.2.2  Let  p  =  (ei, . . . ,  e*)  be  a  (directed)  path  in  G. 

i.  For  any  path  p,  we  define  a  linear  function  fp,  where  fp  =  /ejt  o  -  •  •  o  fC} .  Note 
that  if  both  ends  of  p  lie  either  in  V  or  in  V  then  fp  is  increasing.  Otherwise, 
fp  is  decreasing. 

ii.  We  denote  by  p“l  the  path  p  =  (e*1,. . . ,  ef 1).  Note  that  fp-i  =  f~l  and  hence 
the  two  paths  p,  p-1  correspond  to  the  same  inequality. 

Consider,  for  example,  the  directed  path  from  y  to  J  in  the  graph  of  Figure  5.3. 
The  linear  function  associated  with  this  path  is  /  =  2  —  y,  and  the  corresponding 
inequality  is  z  <  2  —  y. 

In  particular,  Definition  5.2.2  applies  to  cycles  (closed  paths)  starting  at  distin¬ 
guished  vertices.  Cycles  play  a  special  role  since  they  give  a  relation  that  involves  a 
single  variable,  from  which  a  bound  on  this  variable  can  be  deduced.  This  is  formal¬ 
ized  in  the  following  definition. 
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Definition  5.2.3  We  extend  the  definition  of  a  cycle  to  paths  starting  at  r  and 
ending  either  at  r  or  at  v_1. 

i.  Let  c  be  a  cycle  starting  at  one  of  the  vertices  v,  and  v,,  and  ending  at  a 
vertex  Vi  (resp.,  vj.  Let  fc  be  the  associated  linear  function.  The  bound  on  Xi 
implied  by  the  cycle  c  follows  from  the  inequality  x  <  /e(x)  (resp.,  x  >  /c(x)). 
Obviously,  the  implied  inequality  must  hold  for  all  feasible  points.  Note  that  if 
a  cycle  c  starts  at  V{  and  ends  at  v*  (resp.,  starts  at  and  ends  at  V{),  then  fc 
is  decreasing.  Hence,  c  implies  a  lower  bound  (resp.,  an  upper  bound)  on  x,. 

ii.  We  say  that  the  cycle  c  contradicts  a  value  &  of  x;  (resp.,  x,-)  if  the  bound 
implied  by  c  is  of  the  form  x<  >  a  (resp.,  x*  <  a)  whereas  (i  <  a  (resp.,  &  >  a). 

Consider  the  graph  of  Figure  5.2  and  the  cycle  which  consists  of  the  path  from  x 
to  x.  The  function  associated  with  this  cycle  is  f  —  — x  -f  2  and  the  bound  implied 
by  the  cycle  is  x  <  —  x  +  2  (i.e.,  x  <  1). 

5.2.2  Pushing  bounds  along  edges 

We  maintain  numerical  values  associated  with  the  vertices  of  G;  the  value  associated 
with  Vi  (resp.,  uj  is  denoted  by  x<  (resp.,  xj.  Intuitively,  x,-  and  x,  correspond  to 
upper  and  lower  bounds,  respectively,  on  the  value  of  the  variable  x,-.  We  say  that  £ 
is  tighter  than  fi  as  a  value  of  x<  (resp.,  xj,  if  ^  <  ft  (resp.,  (  >  fi).  We  introduce  the 
following  operation  which  considers  an  edge  (u,v)  and  updates  the  bound  associated 
with  v,  as  implied  by  the  inequality  associated  with  (u,v)  and  the  bound  associated 
with  u. 

Definition  5.2.4  A  push  through  an  edge  e  is  defined  as  the  following  operation: 

i.  If  e  =  (t?j , 4  nun{lj, /*(Sjf)}. 

ii.  If  e  =  (vj.vj,  ^  i-  max{if,  /«(*,)}• 
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iii.  If  e  =  (t'j.rJ,  ^  —  max^, /e(*j)}. 

iv.  If  e  =  ( Vj,Vi ),  Xi  —  min{xtl/e(xJ)}. 

A  push  is  said  to  be  essential  if  it  actually  modifies  the  value  of  x,  or  x,. 

Proposition  5.2.5  Suppose  x^  and  x;  (i  —  1, . . .  ,n)  are  initialized  to  any  values  and 
let  X  be  the  set  of  vectors  x  =  (xlf . . .  ,xn)T  which  (i)  satisfy  all  the  given  inequalities 
and  (ii)  <  X{  <  X{  (i  =  1, . . .  ,n).  The  set  X  is  invariant  under  pushes. 

Proof:  Consider  an}r  a:  €  A'  and  a  push  through  an  edge  e  =  (vj,Vi).  (The 
arguments  for  the  other  cases  are  similar.)  Since  x  is  a  feasible  solution  we  have 
x,  <  fe(xj).  Note  that  fe  is  an  increasing  function  when  both  ends  of  e  lie  either 
in  V  or  in  V.  Hence,  from  Xj  <  Xj,  it  follows  that  Xj  <  fe{xj).  Since  xt-  <  Xj,  we 
have  Xi  <  min{Xi,  fe(xj)}.  i 

It  is  easy  to  see  that  a  vector  x  solves  a  given  TVPI  system  if  and  only  if  (i) 
*»  €  5,-  (*  =  1, . . . ,  n)  and  (ii)  the  set  of  values  Xi  =  z^  =  X{  (t  =  1, . . . ,  n)  is  invariant 
under  pushes. 

We  define  a  new  operation  which  amounts  to  pushing  (see  Definition  5.2.4)  through 
all  the  edges  simultaneously.  This  operation  is  used  as  a  subroutine  in  algorithms 
presented  later  in  this  section. 

Definition  5.2.6  Consider  the  graph  G  with  some  set  of  values  at  the  vertices,  x*  = 
xj,  x,-  sSj(isl,...,»).  A  push  phase  on  G  is  assigning,  at  the  respective  vertices, 
the  set  of  values  xf+1,  xj+1  (i  =  1, . . .  ,n)  defined  as  follows: 

2?+1  <-  minjlf.^min  }  {/«(*?)  if  e  =  {Vu Vj),  or  /e(xJ)  if  e  =  {v^Vj) 
x^+1  s-  maxjxj.^max  }{/*(^)  if  c  =  (t U,Vj),  or  fe(x J)  if  e  =  (vi,vi) 

Consider  repeated  applications  of  push  phases.  The  initial  set  of  values  is  denoted 
by  =  *?,  *<  =  2?  (*  =  1,  •  •  •  ,n),  and  xf  and  xf  denote  the  respective  values  of  x,- 
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and  X{  after  the  termination  of  the  fc’th  push  phase. 

Consider  pairs  consisting  of  a  value  and  a  vertex,  and  (xj.rj)  (where  k  >  0 

and  1  <  j  <  n).  The  predecessor  of  a  pair  ((,  v )  is  the  pair  (£' ,  v')  such  that  e  =  (v1,  v) 
is  the  edge  through  which  the  essential  push  determined  the  value  and  £'  =  /e-1(£) 
is  the  corresponding  value  at  v'.  If  the  predecessor  is  not  uniquely  defined  we  choose 
arbitrarily  among  the  qualified  pairs.  A  pair  (£,v)  does  not  have  a  predecessor  if  and 
only  if  £  is  the  initial  value  at  v. 

The  essential  path  associated  with  a  pair  (( ,v )  consists  of  the  sequence  of  pairs 
(£ij  )>•••)  (&,  vt)  and  the  corresponding  sequence  of  edges  ei,...,e/_i  such  that 
0)  (6,r>»)  is  the  predecessor  of  (&+i,t\-+i)  and  e;  is  the  edge  through  which  the  essen¬ 
tial  push  occurs  (i  =  1, ...  ,1—  1),  (ii)  (&,  vi)  =  and  (iii)  does  not  have 

a  predecessor. 

It  is  easy  to  see  that  the  essential  paths  which  correspond  to  (if ,  t^)  and  (zf,t7,) 
(i  =  l, ...  ,n)  can  be  found  within  the  same  time  bounds  of  performing  £  push  phases, 
simply  by  keeping  track  of  all  essential  push  operations. 

We  extend  the  notion  of  a  push  through  an  edge  to  directed  paths.  Let  p  = 
(ex, ... ,  efc)  be  a  (directed)  path  in  G.  A  push  along  p  is  defined  to  be  the  composition 
of  k  successive  pushes,  through  the  edges  ei, . . . ,  e*. 


5.2.3  Properties  of  the  feasible  region 

Consider  the  bounds  on  the  variables  which  are  implied  by  directed  cycles.  We 
discuss  the  relation  of  these  bounds  to  the  feasible  region.  We  first  give  the  following 
definitions. 

Definition  5.2.7  i.  For  each  variable  X{,  let  C  [a»,  6»]  be  the  set  of  values 

which  are  not  contradicted  by  any  simple  cycle.  Note  that  the  two  cycles  c  and 
c_1  imply  the  same  bound.  It  follows  that  in  order  to  find  x~  (resp.,  xj),  it 
suffices  to  consider  cycles  ending  at  v,  (resp.,  vj. 
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ii.  Denote  by  .x^11,  x •nax •  C  [xt",xt"]  the  largest  interval  such  that  (1)  for  all  simple 
paths  p  from  a  vertex  Vj  (resp.,  Vj)  to  vt  we  have  x-nBX  <  /p(x“)  (resp.,  xflBX  < 
/p(x”)  )  and,  (2)  for  all  simple  paths  p  from  a  vertex  Vj  (resp..  Vj )  to  u,-  we  have 
>r  >  fp(Sj)  (resp.,  xf- >  /,(*")  ). 

iii.  An  interval  I  C  R  is  infeasible  with  respect  to  the  variable  x,  if  /n[xf“a,  x™BX]  = 
0.  A  value  X{  (resp.,  x^)  is  infeasible  if  x,  <  i™”  (resp.,  x{  >  xmBX). 

iv.  A  value  Xi  (resp.,  X;)  is  consistent  with  an  interval  [a,  b)  if  x,  >  b  (resp.,  x^  <  a). 

Remark  5.2.8  When  the  values  x^  and  x,"  (i  =  1, . . .  ,n)  are  given,  we  can  compute 
xf1111  and  x™BX  as  follows.  Initialize  the  values  at  the  vertices  to  be  x2  =  x^  and 
( i  =  l,...,n).  Perform  2n  push  phases.  It  follows  from  Definition  5.2.7 
part  ii  that  xf““  =  x”  and  x““  =  x?.  This  procedure  runs  in  0(mn )  time  and  is 
used  in  Megiddo’s  algorithm  [46]. 

For  example,  the  TVPI  system  of  Figure  5.2  has  [x*,x"j  =  [—1,1]  =  [xnun,  xmBX],  and 
[y’rjT]  =  [0)  7]  =  [ynun,ym“].  The  TVPI  system  of  Figure  5.3  has  [x‘,x“]  =  [z*,  z“]  = 
[-oo,+oo],  [y-,T]  =  [y““  3,““]  =  [0,1],  and  [x”*,  x““]  =  [2^,2““]  =  [0,2]. 

The  following  key  observation  is  due  to  Shostak: 

Proposition  5.2.9  [57]  If  the  system  is  feasible,  then~the  interval  [x^,  x[““]  is  the 
projection  of  the  set  of  solutions  on  the  Xi-axis.  Otherwise,  xf““  >  x™”  for  some 
1  <  t  <  n. 

The  proof  follows  from  considering  the  possible  structure  of  minimal  sets  of  in¬ 
equalities  which  imply  a  bound  on  a  variable.  Proposition  5.2.9  and  the  procedure 
described  in  Remark  5.2.8  assert  that  it  suffices  to  consider  all  simple  paths  and  cy¬ 
cles  in  the  associated  gxaph.  Shostak  presented  an  exponential  time  algorithm  that 
essentially  examines  all  directed  simple  cycles  [57]. 

The  following  corollary  states  that  if  two  bounds  are  feasible  separately  but  not 
simultaneously,  then  there  exist  a  simple  path  that  implies  an  inequality  which  asserts 
that  the  bounds  are  not  simultaneously  feasible. 
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Corollary  5.2.10  Let  S  be  a  set  of  TVPI  constraints,  and  si  £  {i;  >  a.  x,  <  a} 
s2  €  { Xj  >  0,  Xj  <  0}  be  two  bounds.  Let  u,  =  Vi  if  sj  is  an  upper  bound,  and  U{  =  r,- 
otherwise;  and  let  uj  =  Vj  if  s?  is  an  upper  bound,  and  Uj  =  v:  otherwise.  Suppose 
S  U  {sj},  and  S  U  {52}  are  feasible  systems,  but  S  U  {^1,52}  is  not  feasible.  There 
exist  a  simple  path  p  from  Ui  to  u~ 1  such  that  (i)  fp(a)  <  0,  if  s^  is  a  lower  bound, 
and  (ii)  fp(a )  >  0,  if  s2  is  an  upper  bound.  Moreover,  if  p  is  the  tightest  (simple) 
path  from  Ui  to  u~l  relative  to  =  a,  then  if  S2  is  an  upper  ( resp lower)  bound, 
Xj  —  fp(a)  is  the  smallest  (resp.,  largest)  value  of  Xj  subject  to  S  U 

Proof:  Suppose  that  both  Si,S2  are  upper  bounds  (similar  for  the  other  cases). 
Consider  bj  =  r™n  under  the  system  5  U  {si}  and  6'  =  x ““  under  the  system  S. 
We  have  b'-  <  0  and  bj  >  0.  By  definition,  x™*  is  either  (a)  implied  by  a  cycle 
or  a  cycle  and  a  path  (i.e.,  implied  only  by  inequalities  which  involve  exactly  two 
variables),  or  (b)  implied  by  a  single  variable  inequality  involving  some  variable 
Xj  and  a  (possibly  empty)  directed  simple  path.  Since  the  system  5  contains  the 
same  two  variable  inequalities  as  S  U  {fii},  bj  is  determined  by  a  system  of  type  (b) 
consisting  of  the  bound  si  and  a  simple  path  p.  1 

5.2.4  Characterizations  of  TVPI  polyhedra 

Proposition  5.2.9  characterizes  the  feasible  region  in  terms  of  the  associated  graph. 
We  characterize  polyhedra  which  comprise  sets  of  solutions  of  TVPI  systems  ( TVPI 
polyhedra).  Polyhedra  which  can  not  be  expressed  as  such  are  non- TVPI  polyhedra. 

Proposition  5.2.11  Consider  a  polyhedron  P  C  RT.  The  following  statements  are 
equivalent: 

i.  The  polyhedron  P  is  a  TVPI  polyhedron. 

ii.  For  a  bounded  convex  set  P'  =  PDB1 ,  where  B'  is  a  box,  denote  a*  =  minjcgp'  *i, 
bi  =  maxx€p'SCi  (i  =  1,. . .  ,n),  and  denote  by  B  =  X"_i[av, &«■]  the  tightest 
bounding  box  for  P'.  Every  P'  as  above  is  such  that  ±a+\b£P'. 
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x+y+z  = 1 

Figure  5.4:  An  example  of  a  non-TVPI  polyhedron 

Hi.  Every  set  S  of  bounds  (i.e.,  conditions  of  the  form  Xj  >  a,  Xi  <  a,  X{  =  a) 
has  the  following  property.  Denote  by  Bs  €  P"  the  set  of  all  feasible  vectors  for 
S.  P  fl  B  =  0  if  and  only  if  there  exist  a  set  S'  C  S,  where  |5'|  <  2  such  that 
P  n  Bs'  —  0. 

iv.  Same  as  property  Hi,  but  S  contains  only  equations. 

Proof:  We  first  show  that  property  i  implies  properties  ii— iv. 

i  =>  ii  Was  proved  by  Lueker,  Megiddo,  and  Ramachandran  [43]. 

i  =>  iii  Suppose  P  is  a  TVPI  polyhedron,  let  Sp  be  a  set  of  TVPI  constraints  which 
define  P  and  let  P{j}  =  x?“n,xfl"t  (t  =  1, ...  ,n)  be  the  corresponding  projec¬ 
tions.  Let  S  be  a  set  of  bounds  such  that  the  combined  TVPI  system  Sp  U  5 
is  infeasible.  Note  that  the  associated  graphs  of  Sp  and  Sp  U  5  are  identical. 
Denote  by  *£*“,  xf“*  (t  =  1, . . .  ,n)  the  bounds  defined  by  the  system  S  U  Sp. 
B.v  definition,  each  of  xf1^, xf1"  (t  =  l,...,n)  which  does  not  coincide  with 
the  respective  end  point  of  the  interval  P{j},  is  determined  by  a  single  bound 
from  S  and  a  simple  path.  Proposition  5.2.9  implies  that  the  system  Sp  U  5  is 
infeasible  if  and  only  if  for  some  variable  Xj,  xj““  >  xf“*.  Hence,  at  least  one  of 
the  two  bounds  xJB“,xf*“  does  not  coincide  with  an  end  point  of  P<.  Consider 
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the  bounds  S'  C  Sp  C  S  (j5'  <  2)  which  determine  xf“n  >  x™**.  Obviously, 
the  system  Sp  J  S'  is  infeasible. 

iii  =>  iv  Obvious. 

It  remains  to  prove  that  ii-iv  =>  i.  Note  that  properties  i-iv  are  invariant  under 
scalings  and  translations  of  coordinates. 

Figure  5.4  gives  an  example  of  a  non-TVPI  polyhedron.  It  is  easy  to  see  that  for 
this  polyhedron  property  ii  does  not  hold  (consider  the  bounding  box  0  <  x  <  1, 
0  <  y  <  1,  0  <  z  <  1),  and  property  iv  does  not  hold  (consider  S  —  {x  =  0,y  = 
0,2  =  0}). 

Assume  that  P  is  a  non-TVPI  polyhedron.  There  exist  at  least  one  face  F  such 
that  the  following  holds.  Let  A  €  i2mxn,6  €  i?1  be  such  that  {x  €  P”!  Ax  =  6}  is 
the  affine  hull  of  F.  Consider  the  process  of  eliminating  variables  from  the  system 
Ax  =  b,  until  no  two  equations  involve  the  same  variable.  At  least  one  of  these 
equations  involves  more  than  two  variables.  Consider  such  a  face  F.  Without  loss  of 
generality  (by  scaling  coordinates)  we  can  assume  that  the  equation  is  x*  =  1 
(where  l  >  3),  and  for  all  points  x  €  P,  Y%=  i  *«  —  1*  Denote  by  H  the  hyperplane 
H  =  {*|  *»  =  1}-  It  is  easy  to  see  that  the  construction  is  such  that  the 

projection  P'  C  H  of  P  on  H  is  full-dimensional,  that  is,  aff.P'  =  H.  Consider  a 
point  x  €  relint  P‘ . 

iv  =>  i  By  definition,  there  exist  a  small  enough  e  >  0  such  that  for  i  = 

there  exist  x‘  €  P  such  that  xj  =  Xj  —  (l  —  l)e,  and  x}  =  x,-  +  e  (j  =  1, . . . ,  i  — 
l,t  -f  1, . . .  ,t).  Consider  the  set  of  equations  S  =  {x,-  =  £j  +  e|  j  =  1, . . . 
Obviously,  there  is  no  point  in  P  which  satisfies  S.  On  the  other  hand,  there  is 
a  point  which  satisfies  every  subset  of  £  —  1  >2  constraints.  Hence,  property  iv 
does  not  hold  for  P. 

ii  =>  i  By  definition,  there  exist  a  small  enough  e  >  0  such  that  for  t  =  1 

there  exist  x’  €  P  such  that  x\  =  x<  +  (l  —  l)e,  and  x}  =  x,  —  e(jf  =  l,...,»  — 
1, »  + 1, . . . ,  l).  Consider  the  box  B  defined  by  the  intervals  [ij  —  e,  ij  +  {l—  l)e] 
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( i  =  1, . . .  ,£).  Since  x'  £  B  fl  P,  B  is  a  bounding  box  for  B  H  P.  However, 
there  is  no  x  €  P  for  which  Xj  =  ij  -j-  [l  —  2)e  (j  =  1, . . .  ,£).  It  follows  that 
property  ii  does  not  hold  for  P. 


l 


5.2.5  Locating  values 

We  discuss  two  procedures  which  are  used  later  in  the  paper  and  are  based  on  repet¬ 
itive  applications  of  push  phases  and  examining  essential  paths. 

One  of  the  procedures  solves  the  following  problem. 

Problem  5.2.12  [Locate  a  value]  For  a  given  TVPI  system,  a  number  £  and  a  vari¬ 
able  Xi,  locate  £  with  respect  to  the  interval  [xf00,  x]n,lx],  that  is,  decide  whether  (i) 
£i  <  sf®  (ii)  (i  >  *f“,  or  (iii)  (i  €  {*?“  *f*]. 

We  refer  to  solving  Problem  5.2.12  as  locating  the  value  Aspvall  and  Shiloach  [3, 
4]  proved  the  following: 

Proposition  5.2.13  Problem  5.2.12  can  be  solved  in  0(mn)  operations. 

Aspvall  and  Shiloach  utilized  this  result  to  solve  TVPI  systems:  by  solving  in¬ 
stances  of  Problem  5.2.12  their  algorithm  conducts  a  binary  search  which  finds  a 
point  (i  €  [sf"11,  xr~]  (1  <  i  <  n).  They  showed  that  finding  such  a  point  &  can  be 
done  in  0(mnl)  operations  where  I  is  the  number  of  bits  in  the  binary  representation 
of  the  input.  This  yielded  their  0(mna7)  algorithm  for  TVPI  systems:  the  algorithm 
performs  n  steps,  where  in  step  i  a  point  &  6  [xfun,xfax]  is  found,  and  the  equation 
Xi  =  is  added  to  the  system.  If  the  original  system  is  feasible,  then  £  €  F. 

In  Section  5.3  we  present  a  0{mn7)  deterministic  algorithm  for  solving  TVPI 
systems,  which  is  based  on  locating  values  in  the  time  bounds  stated  in  Proposi¬ 
tion  5.2.13.  The  material  contained  in  the  remainder  of  the  current  section  is  needed 


98 


CHAPTER  5.  SYSTEMS  WITH  TWO  VARIABLES  PER  INEQUALITY 


for  achieving  the  randomized  bound  of  0(n3  —  mn)  and  improving  the  paraUel  com¬ 
plexity  of  solving  TYPI  systems.  We  present  and  analyze  an  algorithm  for  locating 
values  which  is  also  applicable  to  the  reveal  problem  which  is  defined  later.  The 
underlying  computation  amounts  to  O(n)  applications  of  push  phases  where  essential 
paths  are  maintained. 

Certificates  of  infeasibility:  Suppose  an  interval  I  =  [a.  b]  is  infeasible  with  re¬ 
spect  to  X{  (equivalently,  either  [—00,6]  or  (a,  00]  is  infeasible).  It  follows  from  Propo¬ 
sition  5.2.9  that  there  exist  a  vertex  w,  a  simple  path  p  from  u  €  to  w,  and 

a  disjoint  simple  cycle  starting  at  w  such  that  the  bound  on  w  implied  by  c  and  the 
path  p  produce  a  bound  on  xt-  which  does  not  intersect  I.  The  path  p  and  the  cycle 
c  form  a  minimal  subsystem  which  implies  the  infeasibility  of  I.  Note  that  the  cycle 
c  may  be  empty  in  which  case  the  infeasibility  follows  from  the  path  p  and  a  bound 
on  the  variable  associated  with  w.  We  refer  to  such  a  system  as  a  certificate  for  I. 
If  c  starts  and  ends  at  u,  we  say  that  the  certificate  is  closed.  If  c  ends  at  u_1  or 
c  is  empty,  we  say  that  the  certificate  is  open.  When  we  refer  to  a  certificate,  we 
interchangeably  mean  the  set  of  edges  E'  =  pU  c  ,  the  set  E'~l  of  the  reversed  edges, 
or  the  corresponding  set  of  inequalities. 

Classifying  values:  We  classify  values  of  variables  according  to  the  types  of  their 
certificates.  Consider  an  interval  I  with  respect  to  x,-.  If  I  has  a  closed  certificate  we 
say  that  I  is  strongly  infeasible.  If  only  open  certificates  exist,  we  say  that  I  is  wealdy 
infeasible.  Otherwise,  if  no  certificate  exists,  I H  [*?“*,  xf“*]  ^  0  and  we  say  that  I  is 
feasible. 

In  particular  this  definitions  apply  when  I  consists  of  a  single  point.  For  1  <  *  <  n, 
we  denote  by  [xfun",xj“x-]  the  set  of  all  feasible  and  wealdy  infeasible  values  of  Xj. 
Note  that  xfun‘,  if1***  (1  <  t  <  n)  are  independent  of  the  single  variable  inequalities. 
We  extend  the  definitions  of  the  concepts  above  from  intervals  to  values  at  vertices 
of  G:  A  property  (having  a  closed/open  certificate,  weak/strong  infeasibility)  holds 
for  a  value  £  at  a  vertex  (resp.,  Vi)  if  it  holds  for  the  interval  [£,  00]  (resp.,  [—00,  £]) 
with  respect  to  x,-. 
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We  present  two  corollaries  of  Propositions  5.2.5  and  5.2.9.  Consider  a  feasible 
TYPI  system  and  the  associated  graph.  Suppose  a  sequence  of  push  phases  is  per¬ 
formed. 

Corollary  5.2.14  If  the  values  at  the  vertices  are  initially  consistent  (see  Defini¬ 
tion  5.2.7),  they  remain  consistent  after  any  number  of  push  operations. 

Corollary  5.2.15  Consider  a  vertex  u  =  y±  (resp.,  u  =  V{)  and  a  value  £  =  (resp., 
£  =  *{)  for  some  Let  (£j>vj)  0  =  !>•••. k  <  *)>  where  (£k,vk)  =  (£,u),  be  the 
pairs  comprising  the  essential  path  (see  Definition  5.2.6)  associated  with  (£,u).  If  £ 
is  infeasible  (resp.,  strongly  infeasible)  at  u,  then  each  of  the  pairs  (£j,Vj)  (1  <3  <k) 
is  such  that  £j  is  an  infeasible  (resp.,  strongly  infeasible)  value  ofvj. 

Consider  an  essential  path  which  contains  a  cycle  (a  vertex  appears  more  than 
once).  The  following  proposition  enables  us  to  extract  information  from  the  cycle.  It 
states  that  either  all  values  following  the  start  pair  of  the  cycle  are  consistent  (the 
path  does  not  add  information)  or  all  values  preceding  the  last  pair  of  the  cycle  are 
strongly  infeasible  (an  infeasible  value  is  detected).  Moreover,  by  considering  the 
updating  cycle  we  can  determine  which  of  the  two  situations  occurs. 

Proposition  5.2.16  Let  £  be  a  value  at  u  =  v<  (resp.,  u  =  y Suppose  a  cycle  c, 
which  starts  and  ends  at  u,  is  such  that  fe{£)  <  £  (resp.,  fc{£)  >  £)•  Then,  either 
(i)  £  is  strongly  infeasible  at  u  and  the  cycle  c  is  a  closed  certificate,  or  (ii)  if  F  ^  to, 
£  is  consistent  at  u. 

Proof:  The  bound  implied  by  the  cycle  c  is  fe(y )  >  y  (resp.,  fc(y)  <  y).  It  suffices 
to  show  that  it  holds  either  (i)  only  for  values  y  such  that  y  >  £,  or  (ii)  only  for 
values  y  such  that  y  <  £.  The  function  fc(y)  —  y  is  linear  and  thus  can  change  sign 
only  once.  It  cannot  be  a  positive  (resp.,  negative  )  constant,  since  this  contradicts 
/e( £)  being  tighter  than  £.  If  the  function  is  a  negative  (resp.,  positive)  constant, 
the  cycle  implies  that  the  system  is  infeasible  (and  therefore  £  is  contradicted  by 
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the  cycle).  Otherwise,  consider  the  value  y‘  such  that  fc(y‘ )  =  ym .  It  is  immediate 
that  the  bound  implied  by  c  and  the  number  £  are  on  opposite  sides  of  y~.  § 

The  following  algorithm  is  applied  to  a  set  of  values  if,  if  (for  1  <  i  <  n).  The  al¬ 
gorithm  performs  a  sequence  of  push  phases,  and  keeps  track  of  essential  paths.  When 
the  algorithm  detects  a  non  simple  essential  path,  it  either  terminates  or  discards  the 
path.  If  the  algorithm  terminates  as  a  result  of  such  a  path,  a  closed  certificate  is 
found  for  one  of  the  initial  values.  A  vertex  v  is  active  at  a  particular  point  in  the 
execution  of  the  algorithm  if  the  current  value  at  v  resulted  from  an  essential  path 
which  is  not  discarded. 

Algorithm  5.2.17 

i.  For  t  =  1, . . .  ,n,  initialize  the  values  at  and  tJ;  as  if  and  if,  respectively. 

ii.  For  k  =  1, . . . ,  2n  do  steps  iii-vii.  If  the  algorithm  did  not  terminate,  determine 
that  none  of  the  initial  values  is  strongly  infeasible,  and  stop. 

iii.  Perform  a  push  phase.  Denote  the  values  at  the  vertices  after  the  fc’th  push 
phase  by  i*,  Xj  (1  <  k  <  2 n). 

Keep  track  of  all  essential  push  operations  which  result  from  active  vertices.  If 
there  are  no  such  essential  pushes  or  no  active  vertices,  stop  and  determine  that 
none  of  the  initial  values  is  strongly  infeasible. 

(We  assume  that  ambiguities  about  the  edge  which  carried  the  essential  push 
are  resolved  consistently  according  to  some  ordering  on  the  edges.) 

iv.  If  fc  /  2}  for  all  integers  j  and  k  ^  2n,  go  to  step  iii  (next  iteration). 

v.  Optional:  If  for  some  j,  i*  >  i^,  then  stop. 

vi.  For  each  active  vertex  v,  let  p„  be  the  essential  path  that  corresponds  to  the 
last  update  of  the  value  at  v. 

If  pv  contains  a  cycle  (i.e.,  some  vertex  appears  in  more  than  one  pair),  execute 
step  vii.  After  considering  all  vertices,  go  to  step  iii  (next  iteration). 
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vii.  Consider  the  last  simple  cycle  c  on  pv.  Denote  by  u  the  vertex  where  the 
cycle  starts  and  ends.  Denote  by  £  the  value  associated  with  v.  at  its  first 
occurrence  on  the  cycle.  Check  whether  the  bound  implied  by  c  contradicts  f . 
If  there  is  no  contradiction,  £  is  consistent  (see  Proposition  5.2.16),  discard  all 
essential  paths  originating  from  the  first  pair  ( u,( )  of  the  cycle  (all  correspond 
to  consistent  values  according  to  Corollary  5.2.14);  proceed  to  consider  the 
next  vertex.  Otherwise,  if  c  contradicts  (,  (  is  strongly  infeasible.  It  follows 
from  Corollary  5.2.15  that  so  are  all  values  along  the  path  pv  prior  to  the  pair 
(u,fc(£))  (the  last  occurrence  of  u  on  pv).  In  particular,  the  initial  value  at  the 
first  vertex  w  of  pv  is  strongly  infeasible.  Stop. 

Complexity:  The  underlying  computation  of  the  algorithm  is  the  Bellman-Ford 
single-source  shortest  path  computation  [13],  where  we  maintain  information  needed 
to  construct  the  paths  and  test  for  cycles. 

The  algorithm  terminates  after  l  <  2n  iterations,  in  which  case,  it  requires  0(ml) 
sequential  time,  and  0(1)  time  using  0(m)  processors  on  a  CRCW  PRAM. 

If  the  algorithm  terminates  at  step  vii,  it  finds  a  closed  certificate  for  the  initial 
value  at  w.  If  the  algorithm  terminates  during  the  /’th  iteration,  the  closed  certificate 
found  is  of  size  0(£). 

Proposition  5.2.18  Consider  an  execution  of  Algorithm  5.2.17,  where  step  v  is 
skipped. 

i.  At  least  one  of  the  initial  values  is  strongly  infeasible  if  and  only  if  the  algorithm 
terminates  at  step  vii. 

ii.  Suppose  exactly  one  of  the  initial  values,  (  at  the  vertex  w,  is  strongly  infeasible. 
The  algorithm  terminates  at  step  vii  with  the  same  closed  certificate  regardless  of 
the  initial  values  at  other  vertices.  We  refer  to  this  certificate  as  the  certificate 

oft. 

Proof:  We  first  prove  part  i.  The  “if*  direction  is  immediate.  Suppose  an  initial 
value  £  at  a  vertex  w  has  a  closed  certificate  consisting  of  a  path  p  of  length  |pj  to  uo 
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and  a  cycle  c  of  length  r  which  starts  and  ends  at  u0.  After  at  most  ;p  iterations, 
the  value  at  u0  is  at  least  as  tight  as  /p(£)  and  hence,  has  a  closed  certificate 
consisting  of  c.  Similarly,  after  jp|  +  r  iterations  the  same  holds  for  all  the  vertices 
of  c.  Let  e0, . . . ,  er  and  u0, . . .  ,ur,  respectively,  be  the  edges  and  vertices  of  c.  We 
claim  that  all  iterations  for  which  k  >  £  result  in  an  update  of  at  least  one  of 
the  values  at  Uo, . . .  ,ur.  Assume  the  claim  is  true.  Suppose  the  algorithm  did  not 
terminate  during  any  of  the  2n  iterations.  It  follows  that  at  least  one  of  the  paths 
pUy  (1  <  j  <  r)  is  of  length  2n,  and  hence,  contains  a  cycle.  Since  the  value  at  Uj 
is  strongly  infeasible,  it  follows  from  Corollary  5.2.15  that  w  is  the  first  vertex  of 
pUj .  It  follows  from  Proposition  5.2.16  that  each  cycle  in  pu>  contradicts  the  value 
at  the  start  vertex. 

What  remains  is  to  prove  the  claim.  Assume  the  contrary.  Let  £0,  •  •  • ,  £r  be  the 
respective  values  at  u0,  •  •  •  ,«r-  In  particular,  none  of  the  edges  e*.  had  an  essential 
push.  Hence,  for  all  j  =  1,  is  not  tighter  than  £>+imodr.  It  follows 

that  /*,  . <,(&>)  =  fc(( o)  is  less  tight  than  (0-  This  is  a  contradiction  for  c  being  a 
closed  certificate  for  (0. 

We  prove  part  ii.  Consider  an  initialization  where  £  at  uj  is  the  only  strongly 
infeasible  value.  It  follows  from  part  i  that  the  algorithm  terminates  in  step  vii, 
and  finds  an  essential  path  p  which  starts  at  w  and  terminates  in  a  cycle.  All  values 
along  the  path  are  strongly  infeasible.  Suppose  that  an  initial  path  starting  at  a 
vertex  other  than  w  updates  a  value  at  a  vertex  in  p.  It  follows  from  Corollary  5.2.15 
that  the  updated  value  can  not  be  tighter  or  as  tight  as  the  value  resulting  from  p. 
The  proof  follows.  | 

Remark  5.2.19  Consider  a  run  of  the  algorithm  where  step  v  is  performed. 

i.  Suppose  that  the  algorithm  terminates  at  step  v,  where  it  detects  x*  >  Xj. 
Consider  the  initial  pairs  (u,  and  (w,fi)  of  the  two  essential  paths  which 
determined  (v.j,x*)  and  It  is  not  necessarily  true  that  one  of  the  ini¬ 

tial  pairs  is  infeasible.  We  can  conclude,  however,  that  the  two  corresponding 
bounds  can  not  be  satisfied  simultaneously  by  a  feasible  vector. 
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ii.  Consider  a  run  of  the  algorithm  where  at  least  one  of  the  initial  values  has 
an  open  certificate  of  size  L  which  contains  a  nontrivial  cycle.  The  following 
is  immediate:  the  algorithm  terminates  at  either  step  vii  or  step  v  within  L 
iterations. 


Locating  a  value:  We  apply  Algorithm  5.2.17  to  solve  Problem  5.2.12. 

Consider  a  value  £»•  We  show  how  to  decide  whether  or  not  &  >  xf“x  (the  case 
ii  <  x ?an  is  similar). 

Algorithm  5.2.20  [Locate  a  value  with  respect  to  xf***] 

Perform  a  run  of  Algorithm  5.2.17,  where  step  v  is  enabled,  for  the  following  input 
of  values: 

x?  =  aj  for  j  7^  i,  x*j  =  bj  for  j  =  1, . . . ,  n,  and  x?  =  &. 

Conclude  as  follows: 

•  If  the  algorithm  stopped  at  step  iii  (no  active  vertices),  determine  that  &  <  xf”“. 

•  If  the  algorithms  stopped  at  step  v,  consider  the  essential  paths  associated  with 
x*  >  x*.  If  in  neither  path,  the  initial  pair  is  (t^,  &),  conclude  that  the  system 
is  infeasible.  Otherwise,  conclude  that  £,  >  xf***.  ‘ 

•  Suppose  the  algorithm  terminated  at  step  vii.  If  w  =  t^,  determine  that  &  > 
xf“*.  Otherwise,  if  w  ^  the  system  is  infeasible. 

•  If  the  algorithm  terminated  at  step  ii,  determine  that  £,  <  xf“*. 

The  correctness  follows  immediately  from  Proposition  5.2.18  part  i,  and  Re¬ 
mark  5.2.19. 

Reveal  a  strongly  infeasible  value:  We  discuss  applying  Algorithm  5.2.17  to 
solve  the  following  problem. 
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Problem  5.2.21  [Reveal  a  strongly  infeasible  value! 

Given  are  values  £,,  a,  <  ii  <  b,  (i  e  I)  for  the  respective  variables  xt  (i  £  7),  where 
7  C  {1, . . .  ,n}.  Do  one  of  the  following: 

i.  Conclude  that  all  the  values  are  feasible  or  weakly  infeasible. 

ii.  Find  a  strongly  infeasible  value  (j  and  a  closed  certificate. 

Algorithm  5.2.22  [Reveal] 

Perform  a  run  of  Algorithm  5.2.17,  where  step  v  is  skipped,  for  the  input  values: 
Xi  =  Xi  -  &  (i  €  7),  and  x,  =  bt,  x,  =  a;  (t  £  7). 

The  following  is  a  corollary  of  Proposition  5.2.18: 

Corollary  5.2.23  Algorithm  5.2.22  solves  Problem  5.2.21.  If  none  of  the  initial 
values  is  strongly  infeasible,  the  algorithm  requires  0(mn)  operations.  Otherwise,  the 
algorithm  terminates  after  l  <  2n  iterations  with  a  closed  certificate  of  size  0(1).  If 
exactly  one  of  the  values  &  (i  €  I)  is  strongly  infeasible,  the  algorithm  terminates 
with  the  same  closed  certificate  regardless  of  the  other  values. 


5.3  The  basic  algorithm 

In  this  section  we  present  a  framework  for  solving  TVPI  systems.  This  framework 
allows  us  to  state  an  algorithm  for  TVPI  systems  in  terms  of  solving  instances  of 
Problem  5.2.12  (locating  single  values).  The  framework  is  stated  in  Subsection  5.3.1, 
and  the  correctness  proof  is  given  in  Subsection  5.3.2.  In  Subsection  5.3.3  we  reduce 
solving  a  TVPI  system  to  locating  0(n(logJ  n+log  m))  values.  Since  locating  a  single 
value  requires  0(mn)  time  (0(n)  time  in  parallel)  (see  Proposition  5.2.13),  we  obtain 
an  0(mnJ (log 2  n  +  log  to))  deterministic  algorithm  for  solving  TVPI  systems,  which 
runs  in  0(n3)  time  in  parallel. 

In  Subsection  5.3.4  we  introduce  the  problem  of  locating  a  pool  of  values.  A  key  for 
further  improvements  is  reducing  the  solution  of  a  TVPI  system  to  locating  0(logJ  n+ 
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log  m)  pools  of  values.  A  pool  of  values  can  be  located  naively  by  sequentially  locating 
n  single  values.  In  Section  5.4  we  present  faster  parallel  and  sequential  algorithms  for 
locating  a  pool,  which  yield  better  algorithms  for  solving  TVPI  systems. 

5.3.1  The  framework 

We  first  describe  an  idea  introduced  by  Megiddo  [46],  w'hich  is  the  key  in  obtain¬ 
ing  strongly  polynomial  time  bounds.  Consider  the  associated  graph  of  some  TVPI 
system.  Every  directed  path  in  the  associated  graph  corresponds  to  a  two- variable  in¬ 
equality.  Consider  two  inequalities  which  correspond  to  two  paths  between  the  same 
pair  of  vertices  (t?»,  vy),  where  V{  6  These  inequalities  axe  linear,  hence,  there 

exists  a  number  a  such  that  for  all  Zj  <  a  one  of  the  inequalities  implies  the  other, 
and  vice  versa  for  X{  >  a.  If  we  focus  only  on  feasible  points  x  for  which  z,  >  a 
(similarly  z,-  <  a),  one  of  the  inequalities  is  redundant. 

The  algorithm  eliminates  paths  and  simultaneously  restricts  the  feasible  region.  When 
“comparing”  two  paths,  the  decision  about  which  one  to  eliminate  is  done  as  follows. 
First,  the  number  a,  as  above,  is  computed.  The  redundant  path  is  determined  by 
locating  a  with  respect  to  feasible  values  of  z*  (see  Problem  5.2.12). 

Megiddo  applied  the  above  idea  in  an  algorithm  which  basically  performed  n  single- 
source  shortest  path  Bellman-Ford  type  computations,  where  comparisons  amount  to 
locating  values.  The  framework  presented  here  is  based  on  performing  an  all-pairs 
shortest  path  Floyd-Warshall  [13]  type  computation  which  allows  us  to  apply  further 
ideas. 

The  following  definition  formalizes  the  concept  of  comparing  paths. 

Definition  5.3.1 

i.  Suppose  that  pi  and  ps  are  two  directed  paths  from  u  to  v,  and  from  v  to  w , 
respectively.  Denote  by  pipj  the  path  from  u  to  w  obtained  by  concatenating 
Pi  and  p2> 
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ii.  Suppose  that  pi  and  P2  are  two  paths  from  u  to  w.  Let  7  C  R  be  an  interval. 
We  say  that  the  path  p\  is  at  least  as  tight  as  pi  relative  to  I  (denote  it  by 
Pi  -</  P2)  if  either  w  £  V_  and  fPl(£)  >  /Pj(0  for  all  (  €  7,  or  w  €  1'  and 
/,,«)  <  /„({)  for  all  f  S  /. 

iii.  Suppose  P\,Pi,  ■  •  •  ,Pk  are  paths  from  u  to  w.  If  for  some  x,  p<  -</  pj  for  all 
we  write  p^  =  min_<;{pi,p2,  •  ■  •  ,pjt}-  Note  that  when  7  is  a  single  point,  minX/ 
is  well-defined. 

The  algorithm  maintains  a  set  of  intervals  Si  (1  <  i  <  n),  and  a  path  puw  from  u 
to  w  for  every  pair  of  vertices  (u,  tu).  The  algorithm  runs  in  [log2  2n]  phases.  During 
each  phase,  the  paths  and  the  intervals  are  considered  for  possible  updates.  Denote 
by  5*  (1  <  i  <  n)  and  p*w  ({«,«;}  C  V)  the  intervals  and  paths,  respectively,  at  the 
beginning  of  the  k' th  phase.  The  algorithm  has  the  following  properties:  (i)  The  set 
X"=1  5*  contains  a  feasible  point,  and  (ii)  the  path  p*w  is  the  tightest  path  from  u  to 
w,  of  length  at  most  2fc,  relative  to  the  interval  S*  (where  u  6 

The  algorithm  is  based  on  solving  instances  of  the  following  problem: 

Problem  5.3.2  Given  are  a  graph  G  and  a  set  of  intervals  Si, . . . ,  Sn  as  in  Defini¬ 
tion  5.2.1.  For  every  ordered  pair  (u,  v)  €  V  x  V  of  vertices  we  are  given  . . . ,  , 

a  collection  of  directed  paths  from  u  to  v  in  G.  The  goal  is  to  find  a  set  of  n  intervals 
h ,  •  •  • ,  In  and  select  a  path  p~uv  €  {p]^ p£j* }  for  every  pair  (u,  v)  of  vertices,  where 

i.  F  56  0  =►  (X?=1  Ii)  H  F  ±  0,  and 

ii.  u  6  {a.-, t7i}  =>  p’w  =  min^pi,, . . . ,  jfr  }. 

We  later  suggest  algorithms  for  Problem  5.3.2.  The  following  algorithm  uses  it 
as  a  subroutine.  It  first  initializes  the  tightest-paths  matrix  by  selecting  the  tightest 
edge  out  of  every  set  of  multiple  edges  (step  ii).  The  rest  of  the  algorithm  consists  of 
flog  2n]  update  phases  of  the  tightest-paths  matrix  (step  iv).  During  each  phase,  the 
algorithm  considers  n3  sets  of  n  paths  (one  for  each  pair  of  vertices).  It  then  selects 
the  tightest  path  in  each  set.  In  the  last  step,  the  tightest  path  matrix  is  used  to 
compute  a  feasible  vector. 
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Algorithm  5.3.3  [Solve  TYPI  systems] 

i.  [Initialization]  Construct  5  and  G  as  in  Definition  5.2.1. 

ii.  [Initialize  tightest  paths  matrix]  For  each  pair  of  vertices  consider  the  set  of 
paths  of  length  1,  i.e.,  all  the  multiple  edges.  Solve  Problem  5.3.2  relative  to 
these  paths. 

For  (u,  v)  6  V  x  V:  p°uv  <-  p’uv.  For  *  =  1, . . . ,  n:  S{  «-  Si  n  /*,  S°  «-  Si. 

iii.  For  k  =  1, . . . ,  [log2  2n] ,  execute  step  iv.  To  continue,  go  to  step  v. 

iv.  For  each  pair  (u,  v)  £  V  x  V,  consider  the  set  of  paths 

{Pt?}  u  €  V  \  . 

Solve  Problem  5.3.2  relative  to  these  sets  of  paths. 

For  (u,v)  G  V  x  V:  <—  p'„.  For  i  =  1, . . .  ,n:  Si*- Si  D  7i}  Sf  <-  Si. 

v.  Denote  Pil  =  ,<***,  Pjj  .  p^2'1,  m  m  Pij  «  pD^2”1  (for  1  <  <  <  «. 

1  <  j  <  n). 

For  i  =  1, . . .  ,n: 

$  <-  Si n  {*!/?«(*)  <  *},  S,-  4-  5,-  n  {*|/p  (x)  >  x},  Si  *-  Si  n  {xlf^Xx)  <  x}. 


vi.  For  x  =  1, . . .  ,n,  compute  intervals  5,'  =  [o',  fc']  as  follows: 


a{  *—  max 


joi,  m^ax  f^Xbj),  max  /^(o,)} 


K  “in  {K  nun  f^bj),  mjn  /^(a,)  J 

vii.  Compute  a  feasible  solution  x  as  follows,  for  i  =  1, . . .  ,n: 

Xi  €  max  {o', max  /^(x,),  max  /^(x,)}  , min  {tj,  nun  f^Zj),  rmn  /,.;(®,)}] 
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5.3.2  Correctness 

We  prove  the  following: 

i.  For  1  <  *  <  n,  S°  D  S}  D  •  •  •  3  S/log22n'  D  S'.  . 

ii.  If  X”=a  Sf  contains  feasible  points,  then  so  do  X”=1  S-  (for  all  k  >  0)  and  X”=1  S'. 

Put*  (u  €  u;  €  V)  is  the  tightest  path  (relative  to  S-')  from  u  to  w,  of 

length  less  than  or  equal  to  2*. 

iv.  If  the  system  is  feasible,  the  process  described  in  step  vii  results  in  a  feasible 
vector  x. 

Claims  i  and  ii  follow  directly  from  the  statement  of  the  algorithm. 

We  prove  Claim  iii.  Consider  paths  p\ ,  p2  between  the  same  pair  of  vertices. 
Assume  they  originate  at  It  follows  from  the  definition  of  -<  that  if  Pl  -<  p2 

relative  to  S* ,  then  pi  -<  p2  relative  to  S*+1.  Consider  paths  of  length  at  most 
2fc+1  from  u  to  lt  to  show  that  when  puv  =  puv>Pwv  and  p'uv  =  p'^p'^,  if 

Puv>  ~<j  P'uw  and  Pmv  -</  p^fV)  then  puv  -</  p'uv.  The  latter  is  straightforward. 

We  prove  claim  iv.  We  first  show  that  S'  C  [afaiB,  **■“]  ( i  =  1, . . .  ,n).  We  claim 
that  Sj  ^  C  [x.j,  x-\.  If  the  latter  holds,  it  follows  from  Proposition  5.2.9  that 
S[  C  [if"1,  *n-  Assume  the  contrary,  that  is,  for  some  1  <  j  <  n,  Sjlog2nl  £  [*’,  x}]. 
Consider  a  point  q  €  Sj  g  ^  \  [x^,5^].  Assume  that  q  >  (the  case  where  q  <  xj  is 
similar).  It  follows  that  there  exists  a  simple  cycle  c,  such  that  either  of  the  following 
is  true: 

i.  The  cycle  c,  starts  and  ends  at  and  is  such  that  fe(q)  <  q. 

ii.  The  cycle  c,  starts  at  Vj  and  ends  at  vj,  and  is  such  that  fe(q)  <  q. 

Denote  c  =  Pvjvi  (for  case  (i))  and  d  =  p^.  (for  case  (ii)).  It  follows  from  claim  iii 
that  the  cycle  d  is  the  tightest  simple  cycle  relative  to  sj10*2"1.  Therefore,  fe(q)  < 
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fc{v)-  On  the  other  hand,  due  to  step  v  of  the  algorithm  fc'(v)  >  V-  This  is  a 
contradiction.  We  had  shown  that  St-  C  lx™11,  ( i  =  1, ....  n). 


We  conclude  the  proof  of  Claim  iv.  Consider  the  computation  performed  in 
step  vii.  For  i  =  l,...,n  consider  the  set  of  intervals  S'{  (1  <  l  <  n)  defined  as 
follows.  If  l  <  i,  S\  —  {x/}.  Otherwise, 


5*  = 


max 


|ai,ma^/P7^(ii),max/PjJ(xj)J  ,min  {  fcj ,  min  /p_(x , ) , mm  fp.{ij ) j j 


We  claim  that  for  i  =  the  following  holds.  For  each  p  £  5‘-  there  exist  a 

feasible  vector  £  such  that  (j  =  p  and  (t  £  S}  (£  ^  j).  To  conclude  the  proof  of 
property  iv,  it  suffices  to  prove  the  claim.  We  prove  the  claim  by  induction  on  i. 

In  the  base  case  5]  =  S't  (1  <  £  <  n).  We  show  that  for  every  ^  €  Sj-  there  exists  a 
feasible  solution  x  such  that  Xj  =  £ j  and  xi  £  S't  for  t  ^  j.  Assume  the  contrary.  It 
follows  from  Proposition  5.2.11  that  there  exist  two  bounds  £  {xj  >  £j )  Xi  —  £j} 
and  s2  €  {xi  <  b't,xi  >  a*}  such  +hat  th'*  system  subject  to  x*  £  S'k  (1  <  k  <  n) 
with  the  additional  bounds  Si,s2  is  infeasible.  Assume  that  sx  is  Xj  >  (j  and  s2  is 
xi  <  b\  (the  other  cases  are  treated  similarly).  It  follows  from  Corollary  5.2.10  that 
there  exists  a  simple  yatL  p  from  vi  to  Vj  such  that  fp[b'i)  <  £ j .  In  contrast,  it  follows 
from  property  iii  and  the  computation  of  step  vi  that  fp'{b't)  >  b'-  for  all  paths  p'  from 
v/  to  Vj,  hence  a  contradiction. 

We  prove  the  correctness  :  f  the  induction  step.  Consider  the  step  which  determines  x ;. 
Assume  that  the  claim  is  true  for  previous  steps.  The  induction  hypothesis  asserts 
that  subject  to  the  constraints  i/  £  Sj  (1  <  f  <  n),  we  have  [xj'"”,  x^“]  =  S}  = 
[a),  b\}.  For  1  <  £  <  n,  let  xj11111,  x““  be  the  respective  values  of  x^“,  if**  subject  to 
Xi  €  <5/  (1  <  l  <  n)  and  the  additional  constraint  x,-  =  Xj.  We  need  to  show  that  for 
1  <  t  <  n>  [xj1111,  xj“*]  =  S\+1.  The  direction  5)+1  D  [xf1111,  x^“]  is  obvious.  Consider 
if**  (the  arguments  for  x^  are  similar).  If  x!-“x  =  6)  we  are  done.  Otherwise,  it 
follows  from  Proposition  5.2.9  that  there  exist  a  simple  path  p  from  Vi  £  {v«>  3^}  to 
Vj  such  that  fp(ii)  —  x1?**.  It  follows  from  property  iii  that  either  p'  =  or  p'  =  p,j 
is  as  tight  as  p.  Hence,  fp<{xi )  =  if*  =  fc‘+1 . 


Remark  5.3.4  Note  that  step  vii  can  be  replaced  by  choosing  =  «  +  K)/2, 
!<*<».  This  follows  from  the  fact  that  for  the  system  subject  to  Xj  €  S- 
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1  <  t  <  n,  we  have  [a:?nm,  x™BX]  =  S'  (1  <  i  <  n ),  and  from  Proposition  5.2.11  part  ii. 

5.3.3  Complexity  of  the  naive  implementation 

The  complexity  is  dominated  by  the  calls  to  an  algorithm  for  Problem  5.3.2.  We 
present  a  naive  algorithm  for  the  problem,  which  is  based  on  locating  single  values 
(see  Problem  5.2.12).  The  algorithm  consists  of  n  sequential  stages  as  follows.  At  the 
i’th  stage  the  interval  L  is  computed  and  a  tightest  path  is  found  for  each  of  the  0(n ) 
sets  of  paths  which  emanate  from  either  one  of  In  the  proceeding  stages  we 

consider  the  system  with  the  additional  constraint  a,  £ 

We  discuss  stage  i.  We  consider  O(n)  sets  of  paths  with  the  goal  of  choosing  a 
tightest  path  in  each  set.  This  is  done  in  0(log  k^)  iterations,  where  in  each  iteration 
the  total  number  of  paths  which  need  to  be  considered  reduces  from  r  to  n+3(r — n) /4. 
Initially,  r  =  After  the  stage  terminates,  r  =  n  and  each  set  contains  a  single 
path  (the  tightest  path).  Each  iteration  is  as  follows.  First,  we  pair  up  paths  which 
belong  to  the  same  pair  of  vertices.  Each  such  pair  corresponds  to  a  comparison 
between  the  two  paths  that  needs  to  be  resolved.  For  each  pair,  we  compute  the 
intersection  of  the  two  linear  functions  which  correspond  to  the  two  paths.  Each 
“comparison”  amounts  to  locating  the  intersection  point  with  respect  to  the  interval 
*?*“].  We  solve  one  instance  of  Problem  5.2.12  to  locate  the  median  of  these 
r/2  intersections.  By  doing  this,  half  the  comparisons  are  resolved,  and  the  number 
of  remaining  paths  is  at  most  n  -f  3 (r  —  n)/4. 

The  following  proposition  is  immediate. 

Proposition  5.3.5  Stage  i  can  be  performed  by  an  0(k W  +  mn log  k^)  algorithm, 
where  k W  =  +  fcu«)  fu  =  Vi).  A  parallel  algorithm  runs  in  0(n  log  k^)  time 

using  0(m  -f  k^)  processors  on  a  CRCW  PRAM. 

We  discuss  the  resulting  complexity  of  Algorithm  5.3.3.  In  step  ii,  the  total  number 
of  paths  (£"=1  k^)  is  the  number  of  edges  in  the  graph.  Hence  the  number  of  opera¬ 
tions  is  0(mn2  logm).  In  step  iv,  k^  =  n2  (1  <  t  <  n).  Hence,  the  number  of  opera¬ 
tions  in  each  execution  of  step  iv  is  0(ns  +  mn 2  log  n).  Step  iv  is  performed  flog2  2n"| 
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times.  It  follows  that  the  total  number  of  operations  is  O  (mn2(logm  —  log2n)j.  On 
a  CRCW  PRAM  the  algorithm  runs  in  0  (n2(logm  —  log2n)j  time,  using  0(m  —  n2) 
processors. 

5.3.4  Solving  TVPI  systems  by  locating  pools  of  values 

Consider  the  “naive”  algorithm  for  solving  Problem  5.3.2.  The  algorithm  consists  of 
n  stages  which  are  performed  sequentially.  We  present  a  different  algorithm  where 
the  stages  are  performed  “concurrently”  with  interleaving  iterations:  the  algorithm 
consists  of  0(maxi<j<n  log  k M)  phases,  where  phase  t  comprises  the  i’th  iterations  at 
each  of  the  n  stages. 

An  iteration  of  stage  1  <  j  <  n  amounts  to  locating  a  value  of  Xj.  Performing  a 
phase  amounts  to  solving  an  instance  of  the  following  problem: 

Problem  5.3.6  [Locate  a  pool  of  values]  Given  are  a  graph  G  and  a  set  of  intervals 
Si , . . . ,  S„  as  in  Definition  5.2.1.  Also  given  are  a  set  of  values  fa  (t  £  I)  for  the 
corresponding  variables  Xi  ( i  €  /),  where  I  C  {1, . . .  ,n).  The  goal  is  to  find  a  set  of 
intervals  J;  (t  €  I),  such  that  (i)  F  ^  0  =»  F  D  {*  €  JT*|  Aier  *»  €  J%}  ^  0,  and  (ii) 
&  interior  Ji  (t  €  /)• 

We  give  a  more  elaborate  description  of  the  algorithm  which  reduces  Problem  5.3.2 
to  locating  pools  of  values.  The  correctness  is  straightforward.  The  algorithms  per¬ 
forms  interleaving  executions  of  the  “n”  stages.  This  is  done  in  O(log  max*  k^) 
phases.  Denote  by  I  C  {1, . . .  ,n}  (|J|  =  t)  the  set  of  stages  which  did  not  terminate 
at  the  current  phase  (initially,  I  =  {1, . . .  ,n}).  Phase  j  is  as  follows: 

i.  For  each  l  €  I:  compute  the  value  (i  of  z/  arising  from  an  iteration  of  stage  l. 

This  computation  requires  0(n+(3/4)J  time,  and  requires  logarithmic 

time  in  parallel  with  optimal  speedup. 

ii.  Locate  the  pool  (solve  Problem  5.3.6)  fa  ( l  €  I). 

iii.  For  l  £  /,  Si  St  H  J/. 
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It  follows  that  Problem  5.3.2  is  reduced  to  solving  0(max,-  log  k^)  instances  of 
Problem  5.3.6  and  0(n  max;  log  +  Ei<i<n  additional  time.  The  problem  of 
solving  TVPI  systems  is  therefore  reduced  to  solving  0(log2n  4-  logm)  instances  of 
Problem  5.3.6  and  0(m  -f  n3logn)  additional  computation.  Note  that,  in  parallel, 
the  additional  computation  can  be  done  in  logarithmic  time  with  optimal  speedup. 

Problem  5.3.6  can  be  solved  naively  by  sequentially  solving  |/j  instances  of  Prob¬ 
lem  5.2.12  (locating  a  single  value).  This  requires  mn2  time,  and  0(n2)  time  in 
parallel.  In  Section  5.4  we  give  algorithms  which  improve  over  this  bound. 


5.4  Algorithms  for  locating  a  pool 

The  problem  of  solving  TVPI  systems  was  reduced  to  locating  0(log2  n  -(-log  m)  pools 
(Problem  5.3.6)  and  0(n3  log  n  +  m )  additional  computation. 

We  had  shown  that  a  pool  can  be  located  in  mn2  time,  and  0(n2)  time  in  parallel. 
In  this  section  we  discuss  two  algorithms  for  locating  a  pool.  In  Subsection  5.3.6  we 
present  an  algorithm  which  runs  in  0(mn2)  sequential  time,  and  0(n)  time  in  parallel 
with  optimal  speedup.  In  Subsection  5.4.2  we  overview  a  randomized  0(mn log3  n) 
expected  time  algorithm.  A  parallel  implementation  runs  in  0(n)  expected  time  using 
0(m)  processors.  The  details  of  the  randomized  algorithm  are  given  in  Sections  5.5 
and  5.6. 


5.4.1  0(n)  time  using  0(nm)  processors 

In  this  subsection  we  prove  the  following. 

Proposition  5.4.1  Problem  5.S.6  can  be  solved  on  a  CRCW  PRAM  in  0(n )  time 
using  0(m|7|)  processors. 

Corollary  5.4.2  Algorithm  5.S.S  has  a  parallel  implementation  on  a  CRCW  PRAM 
which  runs  in  0  (n(log2  n  -I-  log  mj)  time  and  uses  0(mn)  processors. 
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The  following  algorithm  solves  Problem  5.3.6  within  the  time  bounds  stated  in 
Proposition  5.4.1. 

Algorithm  5.4.3  [locate  a  pool] 

i.  Locate,  in  parallel,  the  values  &  ( z  €  /). 

For  z  6  /  do  as  follows: 

•  If  £,  >  xf“x,  determine  Ji  <—  { z\z  <  £,}. 

•  If  (i  <  ®?un,  determine  Ji  *—  {z\z  >  &}. 

Let  V  C  1  be  the  set  of  indices  such  that  xf““  <  &  <  xf***  (t  €  /'). 

ii.  For  each  z  €  I'  perform  the  following  computation: 

Initialize  the  values  at  the  vertices  of  G  to  x,-  =  x±  =  and  Xj  =  oo,  Xj  =  —  oo 
(j  ^  »’).  Apply  2 n  push  phases.  For  1  <  j  <  n  denote  the  final  values  at  the 
nodes  Vj,  z/j,  respectively,  by  x‘,  xt. 

iii.  Construct  a  graph  H  as  follows:  The  graph  has  j/'l  nodes  (z  €  /')•  There 
is  an  edge  between  zn*  and  Wj  if  and  only  if  (j  <  x*  or  ^  >  x}. 

iv.  Compute  a  maximal  independent  set  (corresponds  to  nodes  I+  C  I')  in  H. 
Choose  intervals  Ji  (z  €  /')  as  follows: 

Ifz€/+, 

If  z  €  /'  \  /+,  then: 

•  If  mini€/+  2*  <  &,  J,-  <-  {r|*  <  &}. 

•  If  max>€/+  xf  >  &,  Ji  <-  {z|z  >  &}. 

We  prove  the  correctness  of  the  algorithm.  It  follows  from  Corollary  5.2.10  that 
F  fl  {x|xj  =  (i  A  Xj  =  £,}  ^  0  if  and  only  if  x’-  <  ( j  <  2^.  Hence,  the  graph  H 
captures  the  dependencies  between  pairs  of  values.  Proposition  5.2.11  (see  equivalence 
of  properties  i  and  iv)  implies  that  a  set  of  single  variable  equations  is  feasible  if  and 

only  if  every  pair  is  feasible.  Hence,  the  set  of  intervals  Ji  (t  €  I')  solves  Problem  5.3.6. 
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5.4.2  Overview  of  a  O(ran)  algorithm 

In  this  subsection  we  present  an  overview  of  a  faster  randomized  algorithm  for  locating 
a  pool  of  values.  We  prove  the  following: 

Theorem  5.4.4  Problem.  5.3.6  can  be  solved  (i)  sequentially,  in  an  expected  number 
of  0(mn  log3  n)  operations,  and  (ii)  on  a  CRCW  PRAM,  in  0(n  log3  n)  expected 
time,  using  0(m )  processors. 

Consequently,  TVPI  systems  can  be  solved  in  an  expected  number  of 
0  (n3  log  n  4-  mn(log5  n  +  log  m  log3  n)) 

operations. 

A  single  value  can  be  located  in  0(mn)  steps  by  using  Algorithm  5.2.20.  In  the 
previous  section  we  solved  Problem  5.3.2  more  efficiently  by  inferring  from  locating 
a  value  of  one  variable  about  other  values  of  the  same  variable.  The  randomized 
approach  presented  in  this  section  enables  us  to  infer  about  values  of  other  variables 
as  well. 

Consider  a  pool  of  values  £i, . . .  for  the  corresponding  variables.  These  values 
are  classified  into  3  groups  as  follows: 

i.  (i  €  [*$““,  *f“]  (&  is  feasible) 

ii.  &  &  [*f“,  *?*“],  but  (i  €  [*?*-,*?«-]  (£<  is  weakly  infeasible) 

iii.  &  £  jg  strongly  infeasible) 

We  suggest  a  two  stage  algorithm  for  Problem  5.3.6.  In  the  first  stage  the  algo¬ 
rithm  locates  all  the  strongly  infeasible  values  ( ,•  (»  €  I1),  and  determines  whether 
(i  <  xf"®'  or  >  x?**’.  The  respective  intervals  are  determined  to  be  J,  =  [— oc,&] 
if  &  >  xf**",  and  =  [&,  oo]  if  &  <  xf“\  In  the  second  stage  the  algorithm  solves 
an  easier  special  case  of  Problem  5.3.6  where  the  values  are  guaranteed  to  be  either 
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feasible  or  weakly  infeasible.  In  Section  5.5  we  present  an  algorithm  which  solves  the 
first  stage  in  0(mn  log3  n)  time,  and  in  0(nlog3n)  time  using  0(m)  processors  on 
a  CRCW  PRAM.  In  Section  5.6  we  present  an  algorithm  for  the  second  stage  which 
computes  a  solution  in  time  0(mn  log2  n),  and  in  0(nlog2  n)  parallel  time  using  0(m ) 
processors  on  a  CRCW  PRAM. 

From  combining  the  above  results  we  get  algorithms  for  Problem  5.3.6  with  the 
running  times  stated  in  Theorem  5.4.4. 


5.5  Locating  the  strongly  infeasible  values 

We  present  an  algorithm  for  the  following  problem: 

Problem  5.5.1  [Determine  the  strongly  infeasible  values]  Given  are  values  &  (*  €  I) 
for  the  corresponding  variables  x{  (t  €  I),  where  I  C  {l,...,n}.  The  goal  is  to 
determine  for  each  i  €  1  whether  &  <  i"*',  £,  >  or  &  6 

We  prove  the  following: 

Proposition  5.5.2  Problem  5.5.1  can  be  solved  (i)  sequentially,  in  an  expected  num¬ 
ber  of  0(mn  log3  n )  operations,  and  (ii)  on  a  CRCW  PRAM,  in  0(n  log3  n)  expected 
time,  using  O(m)  processors. 

For  purposes  of  analysis  we  classify  the  strongly  infeasible  values  of  variables 
according  to  the  sizes  of  their  certificates: 

Definition  5.5.3  A  strongly  infeasible  value  (i  of  a  variable  x±  is  l-big  if  the  cer¬ 
tificate  is  of  size  at  most  l.  A  value  is  (/l^jj-big  if  it  is  /2-big  but  not  /j-big.  We 
interchangeably  refer  to  strongly  infeasible  values  as  big.  Note  that  all  strongly  in¬ 
feasible  values  are  2n-big. 

We  explain  the  motivation  for  this  classification.  The  algorithm  which  determines 
all  the  big  values  is  based  on  a  tradeoff  between  the  following  two  properties.  These 


116 


CHAPTER  5.  SYSTEMS  WITH  TWO  VARIABLES  PER  INEQUALITY 


properties  are  stated  more  formally  and  proved  later.  The  first  one  “favors’1  big  values 
with  small  certificates:  For  any  /-big  value  £  it  takes  0(mt )  operations  to  find  the 
certificate.  Hence,  a  decision  procedure  which  locates  an  /-big  value  requires  only 
0(m£)  operations.  The  second  property  favors  values  with  large  certificates:  We 
introduce  the  procedure  “check”  that  considers  a  big  value  £  along  with  an  associated 
certificate  E' .  The  “check”  procedure  can  on  average  locate  many  other  big  values 
whose  certificates  intersect  E' . 


5.5.1  The  algorithm 

The  description  of  the  algorithm  includes  calls  to  the  following  three  procedures: 

•  The  first  procedure  “reveal”  (Problem  5.2.21)  considers  a  set  of  k  values,  and 
either  finds  a  big  value  which  belongs  to  the  set,  or  concludes  that  all  these 
values  are  feasible  or  weakly  infeasible.  Algorithm  5.2.22  solves  “reveal”.  Recall 
(see  Proposition  5.2.23)  that  the  algorithm  takes  0(mn)  time.  If  the  input  set 
contains  big  values,  the  algorithm  terminates  after  0(ml)  operations  (where 
/  <  n)  with  a  closed  certificate  of  size  6(1).  If  the  original  set  contained  exactly 
one  big  value  (j,  the  certificate  found  by  the  algorithm  is  the  certificate  of  (j. 

•  The  second  procedure  “check”  considers  an  /-big  value  and  the  corresponding 
certificate.  The  “check”  procedure  uses  the  certificate  to  produce  upper  and 
lower  bounds  on  the  feasible  regions  of  other  variables.  This  gives  rise  to  de¬ 
cisions  regarding  other  values  in  the  pool.  An  0(ml )  time  “check”  algorithm 
is  given  in  Subsection  5.5.3.  In  Subsection  5.5.4  we  discuss  properties  of  the 
“check”  algorithm.  We  show  that  at  least  half  of  the  (/,  2/]-big  values  in  the 
pool  have  the  following  property:  When  a  “check”  is  applied  to  any  of  them,  it 
determines  at  least  //(6n)  of  the  other  (/,  2/]-big  values  in  the  pool. 

•  The  third  procedure  considers  the  set  of  unlocated  values  I  and  computes  a 
crude  estimate  r*  to  the  number  of  big  values  r.  The  estimate  r*  is  such  that 
1/2  <  r'/r  <  2  with  probability  at  least  1/2.  In  Subsection  5.5.5  we  present 
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an  “estimate”  algorithm  which  performs  0((log  log  n)2)  calls  to  the  “reveal” 
procedure.  The  time  complexity  is  dominated  by  these  calls. 

The  following  algorithm  determines  all  big  values  in  a  pool  of  j/|  =  n  values. 
The  algorithm  performs  iterations  where  each  iteration  determines  some  of  the  big 
values.  The  set  I  and  the  number  h  are  updated  accordingly.  Denote  by  r  <  h  be 
the  (unknown)  number  of  big  values  in  1.  Appropriate  values  for  the  constants  Ci , 
C2,  and  C  are  given  later. 

Algorithm  5.5.4  [Determine  all  the  big  values] 

o  Loop  A:  Steps  i-ix 

i.  Apply  the  “reveal”  procedure  to  the  set  I.  If  there  are  no  big  values,  stop. 

ii.  Compute  an  estimate  r"  for  r. 

Reset  the  following  two  counters:  it  0  [number  of  operations]; 

tb  *-  0  [number  of  big  values  discarded]  (in  the  current  iteration  of  Loop  A). 

o  Loop  B:  Steps  iii-ix 

iii.  If  either  (i)  tb  >  r"  f 4  (successful  iteration  of  Loop  A),  or  (ii)  tt  >  Cmn  log2  n 
(unsuccessful  iteration  of  Loop  A),  go  to  Step  i  (next  iteration). 

iv.  b  *—  0  [the  number  of  big  values  discarded  in  the  current  iteration  of  Loop  B]. 

v.  If  r’  >  n/4,  s  *—  1.  Otherwise,  s  <—  |n/r*J  •  Choose  k  =  \C\  log  n]  random 
samples  Si, . . . ,  5*  of  size  s  (with  returns)  from  the  pool  I. 

vi.  Execute  “concurrently”  k  runs  of  “reveal”  (see  Algorithm  5.2.22)  applied  to  the 
sets  Si, . . . ,  Sk  as  follows: 

vii.  Initialize  copy  t  according  to  the  set  of  values  Sj.  Set  t  <—  0  [current  phase 

number].  Set  K  *—  k}  [set  of  “active”  rims]. 


o 


Loop  C:  Steps  viii-ix 
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viii.  Perform  an  additional  iteration  (push  phase)  to  the  runs  in  K .  set  L  —  l  —  1. 
Let  K'  C  K  be  the  possibly  empty  subset  of  the  runs,  for  which  certificates  are 
found.  Apply  “check”  operations  to  these  certificates,  and  discard  from  I  all 
the  big  values  which  are  determined.  Increment  b  and  tb  accordingly,  and  set 

K  <-  K\K'. 

ix.  If  either  l  =  2n  +  1  or  b  >  \C^lr' /[n\og  n)]  then  tt  <—  tt  +  C\lm\ogn,  go  to 
Step  iii  [next  iteration  of  Loop  Bj.  Otherwise,  go  to  Step  viii. 

In  Subsection  5.5.2  we  prove  the  following: 

Proposition  5.5.5  There  exist  constants  C\,Ci,  and  C  as  follows.  If  the  estimate 
r~  is  such  that  1/2  <  r”/r  <  2,  then  with  probability  1/2,  the  current  iteration  of 
Loop  A  terminates  after  determining  r"  / 4  big  values  (the  iteration  is  successful) . 

It  follows  that  with  probability  1/2,  each  iteration  of  Loop  A  determines  at  least 
1  /8  of  the  big  values  in  the  pool.  Hence,  the  expected  number  of  iterations  performed 
until  ail  big  values  are  determined  is  at  most  6  log  n.  Each  iteration  of  Loop  A  runs 
in  0(mn  log2  n)  time.  Hence,  the  expected  time  in  which  Algorithm  5.5.4  terminates 
is  0(mn  log3  n).  This  concludes  the  proof  of  Proposition  5.5.2. 

5.5.2  Probability  for  a  successful  iteration  of  Loop  A 

In  this  subsection  we  prove  Proposition  5.5.5. 

Denote  by  fj  (t  =  1, . . . ,  [log  n] )  the  number  of  (2*-1 , 2’]-big  values  in  7. 

Definition  5.5.6  A  sample  S  C  I  of  values  from  the  pool  is  rood  if: 

i.  The  sample  contains  exactly  one  big  value  Denote  the  size  of  the  certificate 
of  (j  by  2-1  <t<2\ 


ii.  determines  (using  “check”)  at  least  2,-1rj/(6n)  big  values  in  I. 
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iii.  rt  >  r/(21og  n). 


Note  that  since  r  =  r«'i  property  (iii)  must  hold  for  at  least  3/4  of  the 

big  values.  It  follows  from  the  analysis  done  in  Subsection  5.5.4  (for  the  ‘check” 
procedure)  that  property  (ii)  holds  for  half  the  (2‘-1, 2‘j-big  values  (for  all  i).  Hence, 
properties  (ii)  and  (iii)  hold  for  3/8  of  the  big  values. 

The  following  proposition  motivates  the  definition  of  good  samples: 

Proposition  5.5.7  Consider  an  application  of  the  reveal  algorithm  to  a  good  sample, 
followed  by  a  check  to  the  certificate  found.  There  exist  a  number  £  =  2*  (l  <  i  < 
[logn]/,  such  that  by  using  0(m£)  operations  we  can  locate  at  least  £r / (12nlog  n)-f  1 
big  values. 

Proof:  Denote  by  (j  the  big  value  in  the  sample,  and  by  2‘-1  <  l  <  2*  the  size  of 
the  certificate  of  £j.  Consider  an  application  of  the  reveal  algorithm  to  the  sample. 
It  follows  from  Proposition  5.2.18  part  ii  that  the  certificate  is  found  using  0(m£) 
operations.  A  check  procedure  applied  to  the  certificate  locates  at  least  2,_1ri/(6n) 
additional  big  values  from  the  pool.  Note  that  >  r/(21ogn),  hence  at  least 
lr/(12nlog  n)  +  1  big  values  are  located,  i 

Proposition  5.5.8  Consider  a  number  s  such  that  (i)  s  —  1  if  r  >  n/2  and  some¬ 
times  when  r  >  n/8,  and  (ii)  otherwise,  c  =  rs/h  is  such  that  0.5  <  c  <  2.  Consider 
a  random  sample  S  C  /  of  s  elements  (with  returns).  There  exists  a  constant  px  such 
that  S  contains  exactly  one  big  value  with  probability  at  least  px . 

Proof:  The  probability  that  5  contains  exactly  one  big  value  is 


Assume  that  s  =  1.  It  follows  that  8 r  >  h  and  hence  P  >  1/8.  Otherwise,  2r  <  n 
and  P  >  c/4e  >  1/32.  a 
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Corollary  5.5.9  Suppose  that  1/2  <  r’/r  <  2,  and  consider  a  random  sample  S  C  I 
(with  returns)  of  size  s,  where  5  =  1  when  r*  >  n/4  and  s  =  j_n/r"J  otherwise.  The 
sample  S  is  good  with  probability  p  >  3pi/8. 

Corollary  5.5.10  Let  C\  =  —  1/ log(  1  —  p).  Consider  [Cilogn]  randomly  chosen 
samples  ( with  returns)  of  size  s.  If  1/2  <  r’/r  <  2,  the  probability  that  none  of  the 
samples  is  good  is  smaller  than  1/n. 

Proof:  The  probability  that  no  sample  is  good  is  p  =  (1  —  p)'Cl  logT^ .  It  follows 
that  p  <  1/n  when  C\  >  — l/log(l  —  p).  t 

Assume  that  1/2  <  r’/r  <  2,  and  choose  Cz  =  1/12.  We  compute  the  expected 
number  of  steps  performed  until  r~  / 4  big  values  are  located. 

Definition  5.5.11  An  iteration  of  Loop  B  is  beneficial  if  b  >  \Cztr~ /(nlog  nf]  when 
the  iteration  terminates  (at  step  ix). 

A  beneficial  iteration  requires  0(Cim£logn)  operations,  and  each  unbeneficial  itera¬ 
tion  requires  0(Ci iron  log  n)  operations. 

It  follows  from  Proposition  5.5.7  and  Corollary  5.5.10  that  (n  —  l)/n  of  all  iterations 
of  Loop  B  are  beneficial.  Hence,  the  expected  number  of  operations  performed  by  un¬ 
beneficial  iterations  is  not  bigger  than  the  expected  number  of  operations  performed 
by  beneficial  iterations. 

It  follows  that  it  suffices  to  count  the  expected  number  of  operations  performed 
by  beneficial  iterations  until  the  number  of  big  values  is  reduced  by  a  half  (hence,  at 
least  r*/4  big  values  are  located). 

Denote  by  U  ( i  =  1,...,  flogn])  the  number  of  beneficial  iterations  which  ended 
within  (2‘-1, 2*]  phases.  The  total  number  of  steps  performed  by  beneficial  iterations 
is  as  =  C\  2,^1 t<2‘mlogn.  By  definition,  half  the  big  values  are  eliminated  when 
nf=°*nl  {\  —  Cz2x~l/{n  log  n))^  =  1/2.  By  applying  a  log  operation  to  both  sides,  it  fol¬ 
lows  that  (tt-C22‘~1  /(nlog  n))  <  2.  The  total  number  of  operations  of  beneficial 

iteration  until  half  the  big  values  are  decided  is  therefore  as  <  /C^mn log2  n.  The 
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expected  number  of  operations  in  beneficial  and  unbeneficial  operations  is  at  most 
ss  <  I6C1  jCimn log2  n.  We  choose  C  =  2OC1/C2.  It  follows  that  with  probability 
1/2,  at  least  half  of  the  big  values  are  located  before  the  current  iteration  of  Loop  A 
performs  Cmnlog2  n  operations.  This  concludes  the  proof  of  proposition  5.5.5. 


5.5.3  The  “check”  procedure 

In  this  subsection  we  give  an  algorithm  for  performing  “checks.”  The  “check”  al¬ 
gorithm  is  applied  to  a  (2*-1, 2*]-big  value  &  and  the  certificate  &,  and  provides 
information  from  which  it  can  be  concluded  that  certain  other  values  are  also  big. 

We  explain  how  a  closed  certificate  for  a  value  of  one  variable  is  used  to  obtain 
information  about  other  variables: 

Remark  5.5.12  Suppose  &  is  a  big  value  of  Xj.  The  certificate  Gi  consists  of  a  path 
from  Vi  to  a  vertex  u  and  a  cycle  which  starts  and  ends  at  u,  such  that  V(  =  vt  if 
>  x“,x‘,  and  Vi  =  if  &  <  if””".  For  a  vertex  w  in  Gi,  define  &( w)  =  /p(&),  where 
p  is  the  path  in  Gi  from  v,  to  w.  Define  (i(vi)  =  It  follows  from  Corollary  5.2.15 
that  for  every  vertex  w  of  Gi,  the  value  £i{w)  at  w  is  strongly  infeasible. 

The  “check”  algorithm  is  described  below.  The  input  is  a  (2fc_1, 2fc]-big  value 
of  x,-  along  with  the  certificate  G,-. 

Algorithm  5.5.13  [check] 

i.  For  each  vertex  v  €  Gi,  compute  the  big  value  £(v)  (as  in  Remark  5.5.12). 

ii.  Initialize  the  values  of  , . . . ,  and  Xj  ,...,xn  as  follows.  If  Vj  €  Gi,  set 

Xj  —  otherwise,  if  Vj  &  Gi,  set  Xj  =  —00.  If  Vj  €  Gi,  set  Xj  =  Z{y_j)] 

otherwise,  if  Vj  £  Gi,  set  Xj  =  00. 

iii.  Perform  2fc  push  phases. 

iv.  Make  conclusions  as  follows.  If  for  some  value  7)  we  have  ri  >  ~Sj  (resp.,  tj  <  Xj), 
conclude  that  t)  >  xj*“*  (resp.,  ij  <  x^*). 
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The  initial  values  at  the  vertices  are  consistent  (see  Remark  5.5.12),  and  henr  '  (see 
Corollary  5.2.14).  the  final  values  are  consistent.  It  follows  that  the  conclusions  made 
by  the  “check”  algorithm  are  correct. 

Definition  5.5.14  If  in  step  iv  of  Algorithm  5.5.13  we  deduce  that  a  value  rj  is  big, 
we  say  that  £,•  2k  -locates  tj. 

5.5.4  Properties  of  the  “check”  procedure 

This  subsection  establishes  the  following  theorem: 

Theorem  5.5.15  Consider  a  collection  of  b  (2fc_1, 2k)~big  values.  At  least  half  the 
values  ha*  t  the  following  property.  If  a  “check”  is  applied  to  any  of  them,  it  results 
in  2k-locating  at  least  b2k~l /(6n))  other  values  from  the  collection. 

The  following  proposition  analyzes  the  dependencies  among  values  of  different 
variables. 

Proposition  5.5.16  Let  &  and  (j  be  big  values  of  Xi  and  Xj,  respectively.  Let  the 
respective  certificates  Gi,  Gj,  and  the  nodes  V{,  Vj  be  as  in  Remark  5.5.12.  Suppose 
that  |(?i|  <  l,  |Gj|  <  l  for  some  l.  If  the  sets  of  vertices  participating  in  Gi  and  Gj 
intersect,  at  least  one  of  the  values  &  and  £j  is  l-located  by  the  other. 

Proof:  Suppose  u  €  V  (similar  foT  u  €  V_)  participates  both  in  G,-  and  in  Gj.  Let 
Pi  be  the  directed  path  in  G,-  from  v,  to  u  and  let  pj  be  a  directed  path  in  Gj  from  Vj 
to  u.  (see  Figure  5.5  for  an  illustration).  We  claim  that  if  fPi{(i)  <  fPj{ij)  then  & 
^-locates  (j]  otherwise,  (j  ^-locates  &.  Assume  that  fPi((i)  <  fPj((j),  and  consider 
an  application  of  “check”  to  £,.  The  “check”  algorithm  determines  that  fPi{(i)  is  a 
strongly  infeasible  value  of  u.  The  algorithm  initializes  ti_1  to  the  consistent  value 
fpi(ti)  and  applies  l  push  phases.  Consider  the  path  pj 1  from  ti-1  to  vj\  The 
value  at  vj1  after  at  most  l  push  phases  must  be  tighter  than  (j  =  fp-  {fPi{£i))- 
Note  that  is  tighter  than  (j  at  v“l,  since  (j  =  fp-^Upj((j))  and  fPj((j)  is  tighter 
than  fpi(ti)  at  it-1.  Hence,  ^-locates  (j.  § 
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Figure  5.5:  Dependence  of  intersecting  certificates 

Definition  5.5.17  Consider  a  pool  of  b  (2fc-1,2fc]-big  values.  The  influence  graph 
of  this  pool  is  defined  as  follows.  Each  big  value  in  the  pool  corresponds  to  a  vertex 
in  the  influence  graph.  If  one  value  2fc -locates  another,  then  there  is  a  directed  edge 
from  the  vertex  of  the  former  to  the  vertex  of  the  latter. 

Theorem  5.5.15  is  an  immediate  corollary  of  the  following  theorem: 

Theorem  5.5.18  In  the  influence  graph  of  a  pool  of  b  (2k~1 ,2k]-big  values,  at  least 
half  of  the  vertices  have  an  out-degree  greater  than  \b2k~l  /n  —  1. 

We  first  prove  two  propositions  which  are  used  in  the  proof  of  the  theorem. 

Proposition  5.5.19  In  the  intersection  graph  of  b  subsets  o/{l,...,n}  with  cardi¬ 
nality  l,  the  sum  of  the  degrees  is  at  least  b2l/n  —  b. 

Proof:  Let  Si  (i  =  1, ...,»)  be  the  number  of  subsets  containing  i.  Obviously, 
Si  =  bt.  The  sum  of  the  degrees  plus  the  number  of  vertices  (subsets)  is  at  least 
£7=1  *i/l'  This  expression  is  minimized  when  a,-  €  {[M/nJ,  \bl/n ]}  (*  =  1, . . .  ,n). 
It  follows  that  the  sum  of  degrees  is  at  most  \b2t/n\  —  b.  a 

Proposition  5.5.20  In  any  orientation  of  the  edges  of  the  intersection  graph  of  b 
subsets  of  {l,...,n}  with  cardinality  l,  at  least  half  the  vertices  have  out-degrees 
greater  than  \bl/n  —  1. 
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Proof:  It  follows  from  Proposition  5.5.19  that  the  sum  of  out-degrees  (i.e..  the  total 
number  of  edges)  in  any  subgraph  induced  by  b'  vertices  is  at  least  |( bni‘<n  —  b'). 
Suppose,  to  the  contrary,  that  there  exists  a  subset  U  of  cardinality  \U\  =  b'  =  b/2 
where  the  out-degree  of  each  vertex  is  less  than  or  equal  to  \bl/n  —  1.  The  total 
number  of  edges  (which  is  the  same  as  the  sum  of  out-degrees)  in  the  subgraph 
induced  by  these  vertices  is  \bnljn  —  b',  hence  a  contradiction  since  it  is  less  that 
i {b'H/n-b').  , 

Proof  of  Theorem  5.5.18:  The  cardinality  of  a  certificate  of  a  (2fc_1, 2fc]-big 
value  is  greater  than  or  equal  to  £  =  2fe_1.  It  follows  from  Proposition  5.5.16  that 
if  two  big  values  have  intersecting  certificates,  then  in  the  influence  graph  there  is 
an  edge  between  the  corresponding  vertices.  Hence,  the  influence  graph  contains 
an  intersection  graph  of  b  subsets  of  {l,...,n}  of  cardinality  l.  Thus,  the  proof 
follows  from  Proposition  5.5.20.  i 

5.5.5  Estimating  the  number  of  big  values 

In  this  subsection  we  prove  the  following  proposition: 

Proposition  5.5.21  An  estimate  r"  such  that  1/2  <  r’/r  <.  2  with  probability  at 
least  1/2,  can  be  found  using  0((loglogn)2)  “ reveal "  operations. 

The  problem  of  computing  an  estimate  can  be  stated  in  the  following  terms.  We 
are  given  a  collection  of  n  stones,  where  r  of  them  axe  radioactive.  The  number 
r  is  not  known.  We  are  equipped  with  a  primitive  gauge  that  when  exposed  to 
a  set  of  stones  can  determine  if  at  least  one  of  them  is  radioactive  (i.e.,  “reveal” 
operation).  A  good  estimate  to  the  number  of  radioactive  stones  is  a  number  r"  such 
that  1/2  <  r’/r  <  2.  The  goal  is  to  compute  a  good  estimate  which  is  correct  with 
probability  at  least  1/2,  by  using  at  most  0((log  log  n)2)  gauge  readings.  We  first 
give  some  useful  propositions. 
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Proposition  5.5.22  Denote  by  p0(s)  the  probability  that  a  sample  (with  returns)  of 
size  s  contains  no  radioactive  stones  and  let  c  =  rs/h.  Under  these  conditions.  ( i ) 
Po(s)  <  e~c ,  and  (ii)  ifr/h  <  0.5,  po(s)  >  4-c. 

Proof: 


It  follows  that  po(s)  <  e  c  and  if  r/h  <  0.5,  po  >  4  c.  | 

Corollary  5.5.23  Suppose  pa(s)  is  known.  We  can  conclude  as  follows: 

i.  7/0.0625  <  po(s)  <  0.6,  then  1/2  <  c  <  2. 

ii.  If  0.0725  <  p0(s)  <  0.59,  then  0.53  <  c  <  1.89. 

Hi.  Ifpo(s)  <  0.0825,  then  c  >  1.8. 

iv.  If  po(s)  >  0.58,  then  c  <  0.544. 

Proposition  5.5.24  Let  B'  be  the  r.v.  which  corresponds  to  the  number  of  non¬ 
radioactive  samples  out  of  B  =  5000  [log  log  n]  random  samples  of  size  s. 

Prob  {\B' / B  —  po(s)\  >  0.01}  <  1/(2 [log  log n])  . 

Proof:  B'  is  the  number  of  successes  of  B  Bernoulli  trials  with  success  proba¬ 
bility  po(s),  and  hence,  has  a  binomial  distribution.  Therefore  B'  has  expected 
value  po(s)B  and  variance  Bpo(s)(l  -  po(s))  (see  [23]  for  background).  It  fol¬ 
lows  from  Chebyshev  inequality  that  Prob  {| B'  —  p0(^)^|  >  0.01B}  <  2500J S/B2  = 
2500 IB  =  1/(2  [log  log  n] ).  A  smaller  constant  can  be  achieved  by  using  the  normal 
approximation  [23].  | 

Denote  by  Po(s)  the  r.v.  B'/B,  and  by  Po(a)  a  value  of  Po(a). 

We  now  describe  the  algorithm  that  computes  the  estimate  r".  The  algorithm  is 
based  on  performing  a  binary  search  on  the  [lognj  possible  estimate  values  21,  22, 
24,  ...,2^1. 
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Algorithm  5.5.25  fCompute  an  estimate: 

i.  If  p(,(2)  5:  0.35,  stop  and  return  the  estimate  r“  =  2-iogn-. 

ii.  Initialize  a  =  1,  b  =  [lognj.  Repeat  the  following  until  a  =  b: 

iii.  Set  j  =  [(a  +  6)/2"|,  and  s  =  2J. 

•  If  0.0725  <  p{,(s)  <  0.59,  stop  and  return  the  estimate  r“  =  h/s. 

•  If  PM  <  0.0725,  set  b  =  j  and  go  to  step  iii. 

•  If  p^)  >  0.59,  set  a  =  j  and  go  to  step  iii. 

We  show  that  with  probability  at  least  0.5,  1/2  <  r"/r  <  2.  Consider  the  first 
step.  If  r  >  n/2,  po(2)  <  1/4.  Hence,  Prob{p'0(2)  <  0.35}  <  1/ log  log  re.  If  r  < 
n/4,  Po(2)  >  9/16.  Hence,  Prob{p'0( 2)  >  0.47}  <  1/ log  log  n.  It  follows  that  if 
r  >  n/2,  the  algorithm  stops  with  the  correct  estimate  with  probability  at  least 
1  —  1/(2  [log  log  n"().  If  r  <  n/4  the  probability  that  the  algorithm  stops  with  an 
incorrect  estimate  is  smaller  than  1/(2  [log  log  n]). 

The  algorithm  performs  log  log  n  iterations.  Each  iteration  terminates  with  further 
restriction  of  the  set  a,  a  +  1, . . .  ,b.  Consider  a  single  iteration.  It  follows  from 
Corollary  5.5.23  and  Proposition  5.5.24  that  if  at  the  beginning  of  the  iteration,  one  of 
the  values  2“,  2#+1, . . . ,  2fc  is  a  good  estimate,  then  with  probability  (1— 1/(2  [log  log  n] ) 
this  is  still  the  case  when  the  iteration  ends. 

Suppose  log  log  n  >  2.  The  probability  that  when  the  algorithm  terminates  2“  is 
a  good  estimate  is  at  least 

(1  -  1/(2 [log  log  n})^0*10**1  >  0.5  . 

The  algorithm  performs  O(loglogn)  iterations,  where  each  iteration  computes  an 
appropriate  value  of  p'0.  Each  iteration  requires  0(log  log  n)  “gauge  readings”  (see 
Proposition  5.5.24).  Hence,  the  total  number  of  “gauge  readings”  performed  by  the 
algorithm  is  O((loglogn)2). 
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5.6  Locating  a  pool  of  weakly  infeasible  values 
and  feasible  values 

In  this  section  we  present  an  algorithm  which  asserts  the  following: 

Theorem  5.6.1  Problem  5.3.6,  where  the  values  (i  £  I)  are  guaranteed  to  be 
either  feasible  or  weakly  infeasible,  can  be  solved  (i)  sequentially,  in  0(mn  log2  n) 
expected  time,  and  (ii)  on  a  CRCW  PRAM,  in  O(n\og2  n)  expected  time,  using  0{m ) 
processors. 

The  following  is  a  key  property  of  feasible  and  weakly  infeasible  values: 

Proposition  5.6.2  Let  &  and  £,•  be  feasible  or  weakly  infeasible  values  of  Xi  and  Xj, 
respectively.  If  there  exists  a  path  p  from  Vj  €  {vj,Vj}  to  €  {j^,  v,}  such  that  fp((j) 
is  tighter  than  &  at  the  vertex  Vi,  there  exist  such  a  simple  path  p' . 

Proof:  We  assume  that  Vj  —  Vj,  V{  =  Vi  (the  proof  for  the  other  cases  is  similar). 
Consider  the  path  p  from  Vj  to  V{.  Suppose  a  vertex  u  occurs  more  than  once  on 
p.  Consider  two  of  the  occurrences  of  u.  Let  p  =  PiPzPz,  where  p\  is  the  prefix 
of  p  until  the  first  occurrence  and  ps  is  the  suffix  of  p  starting  from  the  second 
occurrence  of  it.  If  fPlP3((j)  is  not  tighter  than  fPl((j)  at  it,  the  path  p'  =  pipz  is 
at  least  as  tight  as  p.  Consider  iterating  the  above  process.  It  follows  that  there 
exist  a  path  p  from  Vj  to  Vj  which  is  at  least  as  tight  as  p  with  respect  to  Xj  = 
and  when  a  vertex  appears  more  than  once,  the  value  at  subsequent  occurrences  is 
always  tighter.  We  claim  that  each  vertex  may  occur  at  most  once  in  p.  Assume 
the  contrary,  and  consider  the  first  cycle  c  along  p.  Let  it  be  the  vertex  where  c 
starts  and  ends,  and  let  p'  =  p\C  be  the  corresponding  prefix  of  p.  It  follows  from 
Proposition  5.2.16  that  either  fpi  ((j)  is  infeasible  at  u,  or  fPl  (£,-)  is  consistent  at  u. 
If  fPi  (£j)  is  infeasible,  p'  is  a  closed  certificate  for  [— oo,  £3]  with  respect  to  Xj  and  we 
get  a  contradiction  to  the  assumption  that  (j  is  not  strongly  infeasible.  Otherwise, 
fn  (£3)  is  consistent  at  tt.  It  follows  that  for  every  prefix  P2  ol  p  which  contains  p\ 
and  ends  at  a  vertex  u',  the  value  fPi  (£,■)  is  consistent  at  u' .  Consider  the  last  cycle 
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c'  along  p.  and  let  p'  =  c'p3  be  the  corresponding  suffix  of  p.  It  follows  that  p'  is  a 
closed  certificate  for  [£,oo]  at  i\.  This  is  a  contradiction  to  &  not  being  strongly 
infeasible.  | 

We  present  two  procedures  which  are  subroutines  of  the  algorithm  for  locating  a 
pool.  The  first  procedure  is  applied  to  a  subset  of  the  values  and  returns  a  partial 
solution.  The  second  procedure  extends  a  partied  solution  as  much  as  possible  without 
restricting  the  feasible  region  further.  We  discuss  the  first  procedure  in  detail.  The 
input  is  a  subset  of  values  £,  (i  £  /'),  where  I'  C  I ■  The  procedure  determines 
intervals  7;  ( i  £  I"),  where  I"  C.  I'  is  a  subset  of  the  values  in  the  input  set.  The 
intervals  J{  (i  £  I")  constitute  a  partial  solution  for  Problem  5.3.6,  that  is,  there  exist 
a  set  of  intervals  J,  (i  £  7  \  I")  such  that  Ji  (t  £  I)  is  a  solution  of  Problem  5.3.6. 

Algorithm  5.6.3  [Find  a  partial  solution] 

i.  Initialize  the  values  at  the  vertices  of  G  as  follows: 

=  ti  (i  €  =  bi,  =  ai  (t  g  /'), 

ii.  Apply  2n  push  phases. 

iii.  For  i  £  7'  do  as  follows: 

•  If  (i  =  Xi  =  Xi,  set  Ji  =  {&}. 

•  If  =  Xi  <  set  Ji  =  [&,  oo]. 

•  If  ti  -  >  Xi,  set  Ji  =  [-oo,6]. 

•  Otherwise,  Ji  is  not  determined. 

Let  7"  C  7'  be  the  set  of  all  t  £  7'  for  which  Ji  is  determined.  For  t  £  I"  do: 

Si  <-  Si  D  Ji  . 

Denote  by  F‘  the  feasible  region  (1  <  *  <  n)  prior  to  applying  Algorithm  5.6.3. 
Consider  an  application  of  the  algorithm.  We  show  that  if  Fm  j*  0  then  the  system  re¬ 
mains  feasible,  that  is,  F  ^  0.  The  following  proposition  proves  a  necessary  condition 
for  correctness:  If  F“  ^  0,  then  for  all  t  £  I",  F’  fl  {*|*i  €  Ji}  ^  0. 
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Proposition  5.6.4  Suppose  x{  =  (resp.,  Xi  =  G)  weakly  infeasible.  There  exist 
a  simple  path  p  from  to  Vi  (resp.,  vt  to  y^J,  such  that  fp((i)  <  £,  (resp.,  fp(£t)  >  C)- 

Proof:  It  suffices  to  prove  the  existence  of  a  not  necessarily  simple  path  p  with 
the  above  properties.  The  existence  of  a  simple  path  would  follow  from  Propo¬ 
sition  5.6.2.  We  assume  xt  =  £,  is  weakly  infeasible  (the  other  case  is  proved 
similarly).  There  exist  1  <  j  <  n  and  two  simple  paths  emanating  from  y^,  pi  to 
Vj  and  p2  to  Vj  such  that  /*  Ui)  <  /»(&)•  Consider  the  path  p  =  pxp2  1  from  y ,•  to 
Vi .  We  claim  that  fp((i)  <  &.  To  prove  the  claim  note  that  fp{G)  =  fp-'  0  /P, (£,). 
Since  the  end  vertices  of  p2  1  both  lie  in  V,  the  function  fp- 1  is  monotone  increasing. 
Hence,  fp- .  o  fK(&)  <  fp~ »  o  /w(&)  =  &.  , 

We  show  that  the  intervals  J;  (t  £  I")  constitute  a  partial  solution: 

Proposition  5.6.5  F  =  F~  fl  {*|  AieJ"  *»  €  J»}  ^  0. 

Proof:  It  follows  from  Proposition  5.6.4  that  J,  is  feasible  for  all  i  £  I".  It  follows 
from  Proposition  5.2.11  part  iii  that  it  suffices  to  show  that  for  every  pair  {i.j}  €  I", 
F~  n  {*|®i  £  Ji,  Xj  £  Jj}  7^  0.  The  latter  follows  from  Corollary  5.2.10.  ■ 

We  discuss  the  procedure  which  extends  a  partial  solution.  Suppose  we  applied 
Algorithm  5.6.3  and  as  a  result,  for  i  £  I",  we  updated  the  intervals  S<  and  determined 
Ji.  The  following  algorithm  may  determine  the  interval  Ji  for  some  additional  values 

i  €  /  \  I". 

Algorithm  5.6.6  [Extend  a  partial  solution] 

i.  Initialize  the  values  at  the  vertices  of  G  as  follows,  x,-  =  a i  and  xt-  =  bi  (t  £  I). 

ii.  Apply  2n  push  phases. 

iii.  For  *  €  I  \  V  conclude  as  follows: 

•  If*i<  (i,  «et  Ji  =  {- oo, & }. 

•  If  Si  >  6»  «et  Ji  =  {&,  oo}. 
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Let  I"  be  the  set  of  €  /  \  /'  such  that  was  determined. 

For  i  €  I"  do:  Si  *—  Si  H  J,. 

Remark  5.6.7  Proposition  5.2.5  asserts  that  the  feasible  region  is  not  changed  by 
the  updates  of  the  intervals  Si  done  by  Algorithm  5.6.6.  In  particular,  if  the  feasible 
region  was  non  empty  prior  to  applying  the  algorithm  it  remains  non  empty. 


5.6.1  The  algorithm 

The  following  algorithm  considers  feasible  and  weakly  infeasible  values  £,  (i  €  /),  and 
solves  Problem  5.3.6  for  these  values.  The  solution  consists  of  a  set  of  intervals  J, 
(t  £  I).  The  algorithm  performs  iterations,  where  in  each  iteration  some  intervals 
Ji  (t  €  I'  C  I)  axe  determined.  The  intervals  Ji  (t  €  I')  are  incorporated  into  the 
system,  we  set  I  *-  I  \/',  and  the  problem  is  reduced  to  locating  the  values  &  (*  €  I). 

Algorithm  5.6.8  [Locate  a  pool] 

i.  Repeat  the  following  until  7  =  0. 

ii.  For  £  =  1,2,4,...,  2l1°BnJ ,  execute  steps  iii-v: 

iii.  Choose  (with  returns)  l  random  elements  from  I.  Let  I'  C  I  be  the  set  of 
chosen  elements. 

iv.  Apply  Algorithm  5.6.3  to  I'.  The  algorithm  determines  Ji  for  »  6  I"  C  I'. 

v.  Apply  Algorithm  5.6.6  and  determine  Ji  for  additional  values  /  €  I.  Set  I  *— 

I  \  (I"  U  /}. 

It  follows  from  Proposition  5.6.5  and  Remark  5.6.7  that  if  the  problem  was  feasible,  it 
remains  feasible  in  any  point  during  the  execution  of  the  algorithm,  hence,  when  the 
algorithm  terminates,  the  intervals  J,-  (*  €  /)  constitute  a  solution  for  Problem  5.3.6. 
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5.6.2  Complexity 

For  values  6  (i  €  I),  consider  a  solution  for  Problem  5.3.6,  that  is,  intervals  J,  ( i  6  I) 
such  that  Ji  €  {[-oc,  &},  [£,  oo]}  and  Xiei  Ji  H  F;  ^  0. 

We  designate  vertices  u,  (i  €  I)  according  to  the  choice  of  intervals  J,  (i  £  /):  If 
Ji  =  [-oo,6],  If  Ji  =  [6,  oo],  Vi  =  2U- 

We  define  a  relation  on  the  set  I,  with  respect  to  the  solution  J,  (t  €  I): 
Definition  5.6.9  Let  i  €  I,  j  €  I  be  two  elements. 

i.  We  say  that  j  locates  i  if  there  exist  a  path  p  from  Vj  to  v,  such  that  fp(£j)  is 
at  least  as  tight  as  6  on  v;. 

ii.  We  say  that  i  interferes  with  j  if  there  exists  a  path  p  from  t\-1  to  vf1  such 
that  fp(£i)  is  tighter  than  (j  on  v~l. 

iii.  We  define  the  relation  <  on  /  as  follows,  j  •<  i  if  and  only  if  i  interferes  with  j. 
(Note  that  when  i  ^  j,  i  locates  j  if  and  only  if  j  interferes  with  i.) 

iv.  For  an  element  i  €  I,  denote  by  pred(»)  =  {j  6  I\j  <  *}  the  set  of  elements 
that  locate  i,  and  by  succ(t)  =  {j  €  I\j  >-  *}  the  set  of  elements  which  interfere 
with  *. 

The  following  is  a  corollary  of  Proposition  5.6.2: 

Corollary  5.6.10  If  a  path  p  as  in  Definition  5.6.9  parts  i  and  ii  exists,  then  there 
must  exist  a  simple  path  with  the  same  properties. 

The  following  proposition  relates  the  locate  and  interfere  relations  to  the  algo¬ 
rithms  presented  earlier.  The  proof  follows  from  Corollary  5.6.10. 

Proposition  5.6.11  Let  i  €  I,  j  €  J  be  two  elements. 

i.  Suppose  that  j  locates  i.  If  Si  c  Ji,  then  an  application  of  Algorithm  5.6.6 
would  determine  Ji. 
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it.  Suppose  that  i  interferes  with  j.  Consider  an  application  of  Algorithm  5.6.3 
where  {i,j}  C  I'-  The  interval  Jj  will  not  be  determined  as  a  result  of  such  a 
run.  Moreover,  if  j  6  I',  the  interval  Jj  is  determined  in  this  run  if  and  only  if 
for  all  values  i  €  I' ,  i  does  not  interfere  with  j . 

We  characterize  the  values  located  in  an  iteration  of  Algorithm  5.6.8,  in  terms  of 
pred  and  succ: 

Corollary  5.6.12  Consider  an  execution  of  steps  iv-v.  An  interval  Jk  (fork  €  I)  is 
determined  as  a  result  if  and  only  t/ pred(fc)  fl  /'  ^  0  and  succ(fc)  fl  /'  =  0. 

Proposition  5.6.13  The  relation  ■<  is  a  partial  order  on  I . 

Proof:  The  transitivity  of  -<  is  immediate.  We  need  to  show  that  -<  has  no  cycles. 
If  there  is  a  cycle,  it  follows  from  the  transitivity  that  for  some  j  €  I,  j  interferes 
with  j.  The  path  p  is  in  this  case  a  simple  cycle  which  constitutes  a  closed  certificate 
for  (j.  This  contradicts  the  assumption  that  (j  is  not  strongly  infeasible.  ■ 

Note  that  each  iteration  of  steps  iii-v  results  in  determining  at  least  one  interval, 
and  therefore,  the  algorithm  terminates  within  0(n )  iterations.  In  order  to  prove 
Theorem  5.6.1,  however,  we  need  to  show  that  the  expected  number  of  iterations  is 
only  0(log2  n).  We  first  discuss  the  chances  of  an  interval  Ji  (t  €  I)  to  be  determined 
in  an  iteration: 

Proposition  5.6.14  Consider  an  execution  of  steps  iii-v.  If  an  element  i  €  /  is  such 
that  |succ(t)|  =  0,  then  Ji  is  determined  with  probability  at  least  po,  where  po  >  0 
is  some  constant,  at  the  steps  where  t  >  |/j/|  pred(*)|.  Otherwise,  t/|pred(t)|  > 
|succ(t)|/3,  then  Ji  is  determined  with  probability  at  least  po  at  the  step  where 

i'i  ^  f  ^  w 

|succ(i)j  |succ(t)| 
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Proof:  We  prove  the  part  where  |  succ(i)j  >  0.  The  first  part  is  similar  and  simpler. 
It  follows  from  Corollary  5.6.12  that  Ji  is  determined  with  probability 

P  =  Prob  {pred(i)  H  I'  ^  0  A  succ(i)  fl  I1  =  0}  . 


We  need  to  show  that  there  exist  a  constant  p0  such  that  P  >  p0.  Since  the  sets 
succ(t)  and  pred(i)  are  disjoint,  it  suffices  to  show  that 

Pi  =  Prob  (pred(i)  fl  /'  ^  0}  >  pi,  and  P2  =  Prob  {succ(i)  n  I'  =  0}  >  p2, 

for  some  constants  pi  >  0,  p2  >0. 

Consider  l  as  above.  It  follows  that  P2  >  0.06  and  Pi  >  0.28.  | 


Proof  of  Theorem  5.6.1:  Algorithm  5.6.8  terminates  with  a  correct  solu¬ 
tion.  We  claim  that  the  expected  number  of  iterations  of  step  ii  is  0(log  n).  Each 
execution  of  step  ii  amounts  to  0(log  n)  calls  to  Algorithms  5.6.3  and  5.6.6,  and 
hence,  Algorithm  5.6.8  amounts  to  an  expected  number  of  0(log2  n)  applications  of 
Algorithms  5.6.3  and  5.6.6.  Recall  that  Algorithms  5.6.3  and  5.6.6  run  sequentially, 
in  0(mn)  time,  and  on  a  CRCW  PRAM,  in  0(n)  time  using  0(m)  processors. 
It  follows  that  the  expected  running  time  of  Algorithm  5.6.8  is  as  stated  in  the 
theorem.  What  remains  is  to  prove  the  claim.  Consider  the  quantity 

s{I)  =  Yi  I  Pred(0l  *  1*1  +  E  I  succ(t)|  . 

»€/  «€/ 

Initially,  s  <  n2.  It  suffices  to  show  that  an  execution  of  step  ii  reduces  s  by  at 
least  a/7,  with  probability  at  least  1/2.  If  a  <  7|/|/4, 

E  |pred(*)|  >  a/7  . 

{t:|*ucc(t)|=0} 

Otherwise,  if  a  >  7|J|/4,  we  have  Li€/  l6UCC(*)i  ^  352i€/  jpred(t)|/7,  and  hence, 

E  I  Pred(*)l  >  2*/7  • 

{«:|  pred(i)l>  |  «uee(i)|/3} 

The  proof  follows  from  Proposition  5.6.14.  ■ 
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5.7  Monotone  systems 

Consider  the  following  two  operations  on  vectors  in  R71: 

x\/y  =  (max{i1,yi},...)max{in,2/n}):r  (the  join  of  x  and  y) 

x/\y  =  (min{®i,yi}, . . .  ,min{x„,yn})  (the  meet  of  x  and  y)  . 

Consider  a  TVPI  system  as  in  Definition  5.1.1,  where  the  matrix  AT  (resp.,  —  Ar) 
is  a  pre-Leontief  matrix,  that  is,  each  row  of  A  contains  at  most  one  positive  (resp., 
negative)  entry.  The  set  of  solutions  of  such  a  system  constitutes  a  semilattice  rel¬ 
ative  to  the  V  (resp.,  A)  operation  (see  Cottle  and  Veinott  [15]).  It  follows  that  if 
all  the  variables  are  bounded  from  above  (resp.,  below),  there  exists  a  solution  where 
all  the  variables  are  maximized  (resp.,  minimized)  simultaneously,  hence,  the  vec¬ 
tor  (resp.,  x mm)  is  feasible.  Also,  the  associated  graph  of  such  a  system  (see 
Definition  5.2.1)  does  not  contain  edges  from  V  to  V  (resp.,  from  V  to  V). 

A  TVPI  system  where  the  two  non-zero  coefficients  in  each  inequality  have  op¬ 
posite  signs  (equivalently,  both  AT  and  ~AT  are  pre-Leontief)  is  called  monotone. 
The  set  of  solutions  of  monotone  systems  constitutes  a  lattice  relative  to  the  V  and 
A  operations.  Note  that  the  converse  is  also  true:  if  a  polyhedral  set  constitutes 
a  lattice  relative  to  the  V  and  A  operations,  then  it  can  be  represented  as  the  set 
of  solutions  of  some  monotone  system.  The  converse  is  a  direct  consequence  of  the 
following  result  due  to  Veinott  [61]:  a  polyhedral  set  is  a  semilattice  relative  to  the 
V  (resp.,  A)  operation  if  and  only  if  it  constitutes  the  set  of  solutions  of  some  linear 
system  Ax  <  b  (resp.,  Ax  >  6),  where  the  matrix  AT  (resp.,  —A7)  is  pre-Leontief. 
Consider,  for  example,  the  non-monotone  system  in  Figure  5.6.  The  points  (5, 1)  and 
(3,4)  are  feasible,  but  the  point  (5,1)  A  (3,4)  =  (3,1)  is  not.  Monotone  systems  are 
related  to  generalized  network  flow  problems:  the  linear  programming  dual  of  the  un- 
capadtated  generalized  transshipment  problem  is  monotone.  In  Chapter  6  we  present 
algorithms  for  generalized  network  flow  problem  that  exploit  this  relationship. 

The  algorithms  presented  here  for  solving  the  feasibility  of  TVPI  systems  can 
be  adapted  to  find  the  vectors  scmm,  1®“  for  monotone  systems.  Note  that  in  the 
assodated  graph  of  a  monotone  system  there  are  no  edges  passing  between  the  two 
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y  -  3x  ^  0  3y  -  4x  ^  0 

8y  -  3x  <  42  5y  +  x  ^  46 

-y  +  x  £  1  -y  +  x  <  4 

-y  +  x  <  -3  -2y  +  3x  £  -7 

y  y 


monotone  system  non-monotone  system 

Figure  5.6:  Examples  of  a  monotone  system  and  a  non-monotone  system 

sets  of  vertices  V  and  V.  It  follows  that  all  certificates  are  either  closed  certificates 
or  simple  paths,  and  all  infeasible  values  must  be  strongly  infeasible. 

We  consider  an  application  of  Algorithm  5.3.3  to  a  monotone  system,  with  the 
goal  of  finding  the  point  a:m,“.  It  ruffices  to  rely  on  a  simpler  notion  of  locating  values: 
locating  the  value  £  means  locating  it  with  respect  to  xf“*  (rather  than  determining 
its  position  relative  to  xj"”  as  well).  Problem  5.3.6  amounts  to  the  following: 

Problem  5.7.1  [Locate  a  pool  of  values]  Given  are  values  &  (i  €  I)  for  the  corre¬ 
sponding  variable  X,  (»  €  J).  Locate  these  values,  i.e.,  determine  if  &  >  xf“  (*  €  /)• 
Choose  intervals  (t  €  /)  as  follows.  J*  =  [&,  oo],  if  &  <  xf“*  and  J;  =  [— oo,&],  if 
(i  >  (*'  e  i). 

It  is  easy  to  see  that  the  intervals  Ji  are  such  that  {*|A»€J  x*  €  Ji)  contains  the 
simultaneous  maximum. 

Consider  step  vii  of  Algorithm  5.3.3.  The  interval  5,-  are  feasible  for  (1  <  t  <  n). 
Hence,  for  1  <  i  <  n,  b[  =  xf1**  (xj  is  unbounded  if  and  only  if  b\  =  oo).  If  all  variables 
are  bounded,  b'  =  i”“.  Otherwise,  the  algorithm  supplies  dependencies  that  allow 
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us  to  compute  feasible  solutions  when  we  determine  some  variables  and  maximize 
others. 


Remark  5.7.2  An  algorithm  which  computes  a;mil3t  (resp.,  xmin)  for  feasible  mono¬ 
tone  systems  can  be  applied  to  compute  xmnx  (resp.,  a:1™11)  for  TVPI  systems  where 
the  matrix  AT  (resp.,  —AT)  is  pre-Leontief.  Consider  a  system  Ax  <  b  where  at 
most  one  of  the  two  nonzero  coefficients  in  each  row  is  positive.  Obtain  a  monotone 
system  A'x  <  b'  by  omitting  all  rows  of  A  where  there  are  two  negative  entries. 
Compute  the  vector  y  wrhich  maximizes  all  variables  in  the  resulting  monotone  sys¬ 
tem.  If  the  original  system  Ax  <  b  is  feasible,  y  is  the  vector  which  maximizes  all 
variables. 


5.8  Concluding  remarks 

In  this  chapter  we  considered  systems  of  linear  inequalities,  where  each  inequality  has 
at  most  two  nonzero  coefficients  (TVPI  systems).  We  presented  algorithms  which 
solve  the  feasibility  problem,  that  is,  either  find  a  point  which  satisfies  all  the  in¬ 
equalities  or  conclude  that  no  such  point  exists.  We  gave  a  0(mn2)  deterministic 
algorithm  and  a  0(n 3  +  mn)  expected  time  randomized  algorithm.  The  complexities 
of  the  respective  parallel  implementations  are:  0(n)  time  using  0(mn)  processors, 
and  O(n)  expected  time  using  0(n 2  4-  m)  processors.  Although  the  analysis  of  these 
algorithms  seems  quiet  lengthy,  the  algorithms  themselves  are  simple.  The  underlying 
computation  amounts  the  basic  Bellman-Ford  and  Floyd- Warshall  [13]  shortest  path 
algorithms  where  only  data  structures  are  used. 

We  give  some  comments  and  suggestions  for  further  research: 

The  time  complexity  of  the  randomized  algorithm  involves  many  logarithmic  fac¬ 
tors  (log*  n).  We  believe  that  a  more  careful  analysis  may  eliminate  some  of  these 
factors.  Another  obvious  question  is  whether  the  0(n3  +  mn)  bound  can  be  achieved 
deterministically. 
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We  discuss  the  possibility  of  improving  over  a  0(n3  —  mn)  bound.  The  0(n3 ) 
factor  results  from  the  all-pairs  shortest  part  Floyd- Warshall  computation  and  is  in¬ 
herent  from  our  basic  framework.  A  different  approach,  however,  may  yield  a  0(mn ) 
algorithm. 

The  feasibility  problem  of  monotone  systems  includes  as  a  very  special  case  the  prob¬ 
lem  of  detecting  existence  of  negative- weight  directed  cycles  in  a  graph  with  n  nodes, 
m  edges,  and  real  weights  associated  with  the  edges.  The  best  known  bound  for  de¬ 
tecting  negative  weight  cycles  is  0(mn),  and  hence,  we  believe  it  is  improbable  that  a 
o(mn)  algorithm  exists  for  solving  TVPI  systems.  We  sketch  the  reduction.  Consider 
a  weighted  graph  G  =  (V,E,w),  where  w  :  E  — ►  R.  The  corresponding  monotone 
system  is  as  follows.  For  each  node  v  £  V  assign  a  variable  xv.  For  each  edge 
e  =  (u,v)  £  E  assign  the  inequality  xv  —  xu  <  w(e).  It  follows  from  Proposition  5.2.9 
that  G  contains  a  negative  weight  cycle  if  and  only  if  the  system  is  infeasible. 

Our  algorithms  solve  the  feasibility  problem  of  TVPI  systems.  They  can  be 
adapted,  however,  to  (i)  find  a  solution  which  maximizes  a  specific  variable  (ii)  and 
find  the  lexicographic  maximum.  Questions  that  remain  open  regard  finding  an  op¬ 
timal  solution  relative  to  an  arbitrary  linear  objective  function.  It  is  not  known 
whether  a  strongly  polynomial  time  algorithm  exists  for  the  problem,  or  whether  we 
can  achieve  a  time  bound  which  is  better  than  specializations  of  existing  general  LP 
algorithms.  A  partial  result  is  due  to  Cosares  [14]  who  showed  that  when  the  objective 
function  has  a  fixed  number  of  nonzero  entries,  the  problem  can  be  solved  in  strongly 
polynomial  time  bounds.  The  degree  of  the  polynomial,  however,  grows  linearly  with 
the  number  of  nonzero  entries. 

We  discuss  the  parallel  complexity  of  solving  TVPI  systems.  Lueker,  Megiddo, 
and  Ramachandran  [43]  showed  that  the  problem  of  finding  an  optimal  solution  rela¬ 
tive  to  a  general  objective  function  is  "P-complete.  It  is  not  known,  however,  whether 
the  feasibility  problem  is  V-  complete.  The  algorithms  presented  in  this  chapter  have 
a  parallel  running  time  of  0{n).  These  are  the  best  known  parallel  time  bound 
achievable  by  a  polynomial  number  of  processors.  Lueker,  Megiddo,  and  Ramachan¬ 
dran  [43]  gave  an  algorithm  for  the  problem  which  runs  in  polylogarithmic  time,  but 
uses  processors. 


Chapter  6 

Algorithms  for  generalized 
network  flows 

A  generalized  network  is  a  digraph  G  =  (V,  E)  given  together  with  positive  flow 
multipliers  ae  (e  €  E)  associated  with  the  edges.  The  multiplier  ae  (e  6  E)  is 
interpreted  as  a  gain  factor  (when  a*  >  1)  or  a  loss  factor  (when  ae  <  1)  of  flow  along 
the  edge  e;  when  xe  units  of  flow  “enter”  the  edge  e,  aexe  units  “leave”.  Generalized 
network  flows  are  also  known  in  the  literature  as  flows  with  losses  and  gains.  They 
can  be  used  to  model  many  situations  that  arise  in  financial  analysis  [25,  26,  42]. 

The  uncapacitated  generalized  transshipment  problem  (UGT)  is  defined  on  a  gen¬ 
eralized  network,  where  costs  are  given  for  the  edges  and  supplies  or  demands  are 
given  for  the  nodes.  The  goal  is  to  find  a  flow  of  minimum  cost,  which  satisfies  the 
supplies/demands.  Adler  and  Cosares  [1]  gave  an  algorithm  for  solving  restricted 
instances  of  UGT  where  there  are  many  sources  and  no  sinks.  Their  algorithm  is 
based  on  a  solution  for  the  linear  programming  dual,  which  turns  out  to  be  a  mono¬ 
tone  TVPI  system,  and  hence,  the  results  of  Chapter  5  imply  better  time  bounds  for 
restricted  UGT. 

In  the  generalized  circulation  problem  (GC)  we  consider  a  generalized  network 
where  demands  (nonnegative  numbers)  are  given  for  the  nodes  and  capacity  con¬ 
straints  are  given  for  the  edges.  The  goal  is  to  find  a  feasible  flow  which  maximizes 
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the  proportion  of  satisfied  demands.  Goldberg.  Plotkin,  and  Tardos  [26  presented 
an  algorithm  for  the  seemingly  more  general  capacitated  generalized  transshipment 
problem  without  costs.  Their  algorithm  is  based  on  solving  an  instance  of  GC  which 
has  a  single  demand  node  (the  source )  and  performs  0(mn )  additional  computation. 
We  present  a  scheme  for  solving  generalized  circulation  problems  by  iteratively  relax¬ 
ing  the  capacity  constraints,  solving  an  instance  of  UGT  on  the  same  network  with 
costs  which  “capture”  the  capacities,  scaling  the  flow  to  a  feasible  one,  and  replacing 
the  capacities  by  the  residual  capacities  relative  to  this  flow. 

This  scheme  introduces  a  general  method  of  approximating  a  solution  to  linear 
programming  problems  in  the  following  situation.  For  given  matrices  A  £  iT*xm, 
U  €  Rlxm  and  vectors  b  €  R'1,  d  €  Rt,  where  U,  d  have  nonnegative  entries, 
maximize  t  subject  to:  (i)  Ax  =  tb,  x  >  0,  and  (ii)  Ux  <  d.  Condition  (ii)  can  be 
viewed  as  generalized  capacity  constraints.  We  assume  that  for  c  >  0,  it  is  “easy” 
to  minimize  cTx  subject  to  Ax  =  b,  x  >  0.  Denote  by  t~  the  maximal  value  of  the 
objective  function.  We  will  show  that  a  feasible  solution  *'  €  R'1 ,  t'  €  R  such  that 
t'  >  t~ /£  cam  be  found  by  solving  a  single  instance  of  the  “easy”  problem. 

Consider  the  generalized  circulation  problem  with  the  relaxed  goal  of  computing 
a  flow  which  satisfies  a  proportion  of  the  demands  which  approximates  to  a  constant 
factor  the  best  achievable  proportion.  For  the  relaxed  problem,  the  scheme  described 
above  yields  a  strongly  polynomial  time  algorithm,  which  is  also  the  fastest  known 
(on  some  ranges)  algorithm.  This  scheme  also  yields  an  algorithm  for  obtaining  an 
optimal  solution,  which  is  the  fastest  known  on  some  ranges. 

In  Section  6.1  we  define  the  UGT  problem  and  review  the  Adler  and  Cosares  [1] 
algorithm.  In  Section  6.2  we  introduce  the  approximation  algorithm  and  apply  it  to 
the  generalized  circulation  problem.  In  Section  6.3  we  introduce  bidirected  gener¬ 
alized  networks  and  discuss  the  UGT  and  generalized  circulation  problems  on  these 
networks.  Section  6.4  contains  concluding  remarks. 

Note  that  for  instances  of  the  problems  mentioned  above  we  need  to  consider  cases 
where  m  =  u;(n2).  The  algorithms  presented  here  handle  multiple  edges  within  the 
stated  time  bounds. 
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6.1  Generalized  transshipment  problem 

Problem  6.1.1  [Uncapacitated  Generalized  Transshipment  (UGT)] 

Given  are  a  generalized  network  G  =  (V,E),  edge-costs  ce  (e  6  E),  and  supplies  (or 
demands)  b,  (1  <  i  <  n)  for  the  nodes.  Find  a  nonnegative  flow  function  x  =  ( xe ) 
such  that 

for  every  i,  ^  ae xe  —  ^  xe  =  b;  , 

e€in(i)  «6out(») 

so  that  the  cost  YlezE0*1*  is  minimized. 

When  6j  >  0  (resp.,  b;  <  0),  we  refer  to  the  i’th  node  as  a  source  (resp.,  sink).  The 
linear  programming  dual  has  the  following  form.  Find  values  for  ,irn  which 

maximize  b;irj,  subject  to  the  monotone  inequalities 

for  all  e  €  E:  it j  -  aevj  <  c*  where  e  is  from  t  to  j  . 

In  this  section  we  consider  restricted  instances  of  UGT  where  there  are  either 
only  sources  (b  >  0)  or  only  sinks  ( b  <  0).  Adler  and  Cosares  [1]  proposed  a 
scheme  for  solving  a  subset  of  the  LP  problems  where  each  variable  appears  in  at 
most  two  inequalities.  In  particular,  the  scheme  is  applicable  to  restricted  UGT 
instances.  They  showed  that  these  instances  can  be  solved  using  a  single  application 
of  Megiddo’s  algorithm  for  TVPI  systems  [46].  An  application  of  the  faster  algorithms 
for  TVPI  systems  presented  in  Chapter  5  can  be  used  instead.  Hence,  restricted 
UGT  instances  can  be  solved  deterministically  in  0  (mn2 (login  -f  log2 n)y  time,  and 
in  0  ( n 3  log  n  +  mn(log  mlog3  n  +  log6  nfj  expected  time. 

We  characterize  the  problems  for  which  the  scheme  of  [1]  is  applicable.  Consider 
an  LP  problem,  V,  of  the  following  form,  minimize  cTx,  subject  to  Ax  =  b,  x  >  0, 
where  A  €  f2nxm  contains  at  most  two  non  zero  entries  in  each  column.  Denote 
by  x’  €  R"1  the  optimal  solution  of  V.  Note  that  the  LP  dual  of  V  amounts  to 
optimizing  an  arbitrary  objective  function  subject  to  a  TVPI  system.  The  scheme 
of  [1]  is  applicable  to  V  if  se“  =  Y,i:b& o  where  x^  >  0  maximizes  cTx  subject  to 

Ax  =  bje*. 
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We  sketch  the  ideas  used  in  their  scheme.  Denote  by  S  the  TYPI  constraints 
At-k  <  c.  Let  xj"111  (resp.,  7rtmax)  be  the  minimum  (resp.,  maximum)  value  of  x, 
subject  to  5. 

If  xjnBX  =  oo  (resp.,  xf“n  =  —  oo),  then  V  is  feasible  only  if  6,  <  0  (resp.,  b{  >  0). 

If  b,  7^  0,  a  vector  x^  as  defined  above  can  be  constructed  from  a  minimal  subset  of 
constraints  from  S  which  implies  (i)  x,  <  x“ax  if  6,  >  0,  or  (ii)  x,  >  xf1111  if  b{  <  0.  The 
edges  which  correspond  to  such  a  minimal  system  comprise  a  generalized  augmenting 
path  [29]  of  flow  to  the  z’th  node  (i.e.,  a  flow  generating  cycle  and  a  path  from  a  node 
on  the  cycle  to  the  z’th  node).  The  vector  is  obtained  by  considering  the  flow 
values  at  the  edges  when  pushing  flow  along  this  augmenting  path. 

It  is  easy  to  see  that  the  scheme  of  [1]  is  applicable  to  restricted  UGT  instances. 
Consider  a  UGT  instances  where  6  >  0  (the  arguments  are  similar  for  b  <  0).  The 
constraints  in  S  are  monotone,  and  therefore,  by  a  single  application  of  the  algorithm 
of  Chapter  5  we  determine  xf“*  for  i  =  1, . . .  ,n  (see  Section  5.7).  If  xf1"  <  oo,  the 
algorithm  also  computes  a  minimal  subset  of  constraints  from  5  which  asserts  that 
Xj  <  xf“*.  The  vector  x^  can  be  constructed  by  using  this  information.  Conclude 
as  follows.  If  x™ax  ss  oo  and  5,-  >  0,  the  UGT  system  is  not  feasible.  Otherwise, 
x*  =  Y*i-.bi>ox^  is  a  solution. 


6.2  Generalized  circulation 

Definition  6.2.1  Consider  a  generalized  network  G  =  (V,  E),  where  demands  bi  >  0 
(1  <  i  <  »)  are  given  for  the  nodes  and  capacities  dj  >  0  (possibly  dj  =  oo)  are 
given  for  the  the  edges. 

i.  A  generalized  flow  is  a  flow  function  x  =  (ie)  (xe  >  0)  which  satisfies  the 
following.  There  exist  a  scalar  t(x)  =  t  such  that  for  every  1  <  t  <  n, 
D«€in(t)  a«*«  “  E.eout(i)  *€  =  (the  flow  x  satisfies  a  proportion  i(x)  of  the 
demands). 

ii.  A  generalized  flow  is  feasible  if  x«  <  c,  for  all  edges. 
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Vaidya,  89  [60]  j  Q(n2m1-a  log(n7))  j 


!  Kapoor  and  Vaidya,  88  [36 

;  0(n2Smlb\og(n'))) 

|  Goldberg,  Plotkin  and  Tardos,  88  [26] 

0(n2m2  log  n  log2  7)  j 

Table  6.1:  Some  previous  results  on  generalized  circulation 
Problem  6.2.2  [Generalized  Circulation] 

Given  are  a  generalized  network,  demands,  and  capacities  as  above.  Find  a  feasible 
generalized  flow  x~  such  that  t(xm)  is  maximized.  Denote  V  =  t(x"). 

We  refer  to  t"  as  the  optimal  value.  A  feasible  generalized  flow  x  is: 
i.  optimal  if  i(x)  =  and  ii.  e-optimal  if  >  1  —  e. 

Vaidya  [60]  gave  an  0(n2m1B  log(n7))  time  algorithm  for  the  problem,  were  7  is 
an  upper  bound  on  the  numerators  and  denominators  of  the  capacities,  multipliers, 
and  costs.  Vaidya’s  bound  is  based  on  a  specialization  of  his  currently  fastest  known 
general-purpose  linear  programming  algorithm  and  relies  on  the  highly  impractical 
fast  matrix  multiplication  algorithms.  The  previously  fastest  known  algorithm,  due 
to  Kapoor  and  Vaidya  [36],  does  not  rely  on  fast  matrix  multiplication,  and  has  a 
bound  worse  by  a  factor  of  y/n.  A  new  result  by  Murray  [48]  is  based  on  a  different 
specialization  of  Vaidya’s  LP  algorithm  to  generalized  flow.  Murray’s  generalized 
circulation  algorithm  matches  the  bound  of  [60]  and  does  not  rely  on  fast  matrix 
multiplication.  The  algorithms  of  [36,  48,  60]  are  applicable  to  the  more  general  min- 
cost  generalized  flow  problem.  A  different  algorithm,  of  more  combinatorial  nature, 
was  given  by  Goldberg,  Plotkin  and  Tardos  [26].  These  results  are  summarized  in 
Table  6.1. 


6.2.1  A  generalized  circulation  algorithm 

The  algorithms  discussed  above  are  designed  to  And  an  optimal  flow.  We  introduce 
an  algorithm  for  Problem  6.2.2  which  is  based  on  iteratively  obtaining  a  (1  —  1/m)- 
optimal  flow  and  then  considering  the  problem  on  the  residual  network.  Hence,  an 
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Computing  an  e-optimal  flow: 


expected  time 

0  (m  log  e  1  (n3  log  n  -1-  mn(log 

m  log3  n  — lor  *»))) 

deterministic 

0  [m2n2  log  e_1(log  m  -1-  log2  n) 

Computing  the  optimal  solution: 


expected  time 

0  (|f|(mn3  m2n)) 

|  deterministic 

0(\t~\m2n2) 

Table  6.2:  Bounds  for  generalized  circulation 
e-optimal  flow  can  be  computed  using  0(mlog  e-1)  iterations. 

Note  that  when  e  is  a  constant,  0(m)  iterations  suffice.  The  optimal  value  can 
be  found  within  0(m\t’\)  iterations,  where  |f“|  is  the  number  of  bits  of  accuracy 
required.  Note  that  |t’|  <  mlog(n7),  where  7  is  the  largest  enumerator/ denominator 
of  a  capacity  or  a  multiplier  in  the  network. 

We  discuss  the  complexity  of  each  iteration.  In  Subsection  6.2.2  we  introduce  an 
approximation  method  that  allows  us  to  find  a  (1  —  l/m)-optimal  feasible  generalized 
flow  by  solving  a  single  UGT  instance  on  the  same  generalized  network,  where  the  ca¬ 
pacity  constraints  are  relaxed  and  costs  are  introduced.  It  follows,  that  each  iteration 
of  our  algorithm  amounts  to  solving  an  instance  of  the  restricted  UGT  problem. 

The  resulting  deterministic  and  randomized  bounds  for  computing  an  e-optimal 
generalized  circulation  are  summarized  in  Table  6.2.  Our  algorithm  is  more  practical 
than  [60].  When  the  algorithm  is  used  to  find  an  approximate  solution,  we  achieve 
strongly  polynomial  time  bounds  which  are  also  strictly  better  than  [26,  36],  and 
better  than  [60]  on  some  ranges  (e.g.,  when  the  size  of  the  binary  encoding  of  capacities 
and  multipliers  is  large).  The  algorithm  also  yields  improved  bounds  for  some  ranges 
(e.g.,  when  we  know  the  number  of  bits  in  the  binary  encoding  of  t"  is  small)  for 
obtaining  an  optimal  solution. 

Note  that  when  e  =  1  /q(m,n),  where  q  is  a  polynomial,  an  e-optimal  flow  can  be 
found  in  strongly  polynomial  time  bounds.  It  is  still  not  known  whether  a  strongly 
polynomial  time  algorithm  exists  for  finding  an  optimal  solution.  This  question  is 
of  a  particular  interest  because  generalized  circulation  is  one  of  the  simplest  classes 
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of  linear  programming  problems  for  which  no  strongly  polynomial  algorithms  are 
known  [26,  59] . 


6.2.2  Obtaining  an  approximation 

Consider  linear  programming  problems  of  the  following  form.  Given  are  matrices 
A  £  /2nxm,  U  £  Rlxn,  and  vectors  b  £  d  £  R*,  where  U  >  0,  d  >  0.  The  goal  is 
to  maximize  t,  subject  to  Ax  =  tb,  x  >0,  and  Ux  <  d.  We  refer  to  the  constraints 
Ux  <  d  as  generalized  capacity  constraints.  A  vector  *  >  0,  such  that  Ax  a  b  (Ax 
is  proportional  to  b)  and  Ux  <  d  is  called  feasible.  For  a  feasible  vector  x,  denote 
by  t(x )  the  scalar  t  such  that  Ax  =  tb.  Denote  by  t’  the  optimal  solution,  and  by 
x~  some  vector  where  t(x“)  =  t~ .  A  feasible  vector  x  is  e-optimal  if  t(x)/t~  >1  —  6. 

Suppose  that  for  0  <  c  €  iZ™  it  is  “easy”  to  compute  a  vector  x  >  0  which 
minimizes  cTx,  subject  to  Ax  =  6.  We  refer  to  problems  of  this  form  as  uncapacitated 
instances.  An  instance  of  the  original  problem  is  referred  to  as  a  capacitated  instance. 
Note  that  when  the  capacitated  problem  is  an  instance  of  generalized  circulation,  U 
is  a  diagonal  matrix  with  at  most  m  =  |JE?|  rows.  The  corresponding  uncapacitated 
problem  is  an  instance  of  UGT  on  the  same  network,  where  only  demand  nodes  are 
present. 

We  presents  an  algorithm  for  constructing  a  (l/7)-optimal  vector  y.  The  algorithm 
amounts  to  solving  a  single  instance  of  the  uncapacitated  problem.  Consider  the  cost 
function 

p(*)  =  '52ui,x/di  . 

»=1 

It  is  easy  to  verify  that  this  function  is  linear  and  has  the  following  properties: 

i.  If  x  is  feasible  then  p(x)  <  t. 

ii.  If  x  >  0,  Ax  oc  6,  and  p(x)  <  1,  then  x  is  feasible. 


iii.  If  jK*‘)  >  p(x“)  >  1  (since  for  some  i,  Ui,x"  =  *)• 
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Consider  a  vector  y  >  0,  Ay  a  6,  p(y)  =  1  which  maximizes  t(y).  Note  that  the 
vector  ly  maximizes  t{x)  for  all  vectors  x  >  0  such  that  p(x )  <  t  and  Ax  oc  b. 
In  particular  i(£y)  >  t‘,  and  hence,  i(y)  >  t~ jl.  Such  a  vector  y  can  be  obtained 
by  normalizing  a  vector  x  >  0  which  minimizes  p(x)  subject  to  Ax  =  b.  Also,  y 
is  feasible  and  therefore  provides  the  desired  approximation.  A  formal  description  of 
the  algorithm  follows. 

Algorithm  6.2.3  [Compute  a  (l/i)-optimal  vector] 

i.  Solve  the  following  instance  of  the  uncapacitated  problem:  Minimize  p(x)  sub¬ 
ject  to  x  >  0  and  Ax  =  6. 

If  it  is  infeasible,  then  stop  and  claim  that  x  =  0  is  the  only  feasible  vector  of 
the  capacitated  instance.  Otherwise,  let  x  be  the  solution. 

ii.  If  p(x)  =  0,  stop;  the  vectors  rx  are  feasible  for  all  r  >  0,  and  the  capacitated 
problem  is  unbounded. 

iii.  Otherwise,  when  p( x)  ^  0,  compute  the  largest  number  r  (must  be  bounded), 
such  that  rUx  <  d.  Claim  that  rx  is  (l//)-optimal. 

Correctness:  Consider  the  vector  x  computed  in  step  i  of  the  algorithm.  Note  that 
x  is  such  that  t(x)  =  1.  Hence,  i(rx)  =  ri(sc)  =  r. 

Proposition  6.2.4  p(x)  —  0  if  and  only  if  the  capacitated  problem  is  unbounded. 

Proof:  Suppose  that  p(x)  =  0.  Observe  that  for  all  r  >  0,  p(rx)  =  0,  and  hence, 
rx  is  feasible.  Also,  for  all  r  >  0,  t(rx)  =  r.  Hence,  the  problem  is  unbounded. 
Suppose  the  problem  is  unbounded.  There  exist  a  vector  x  such  that  rx  is  feasible 
for  all  r  >  0.  It  follows  that  p(rx)  =  rp(x)  <  £  for  all  r  >  0.  Hence,  p(x)  =  0.  § 

The  following  proposition  concludes  the  correctness  proof. 


Proposition  6.2.5  Ift ‘  is  bounded,  then  r  >  t" //. 
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Proof:  For  k  >  0,  denote 

R(k )  =  max{£(y)  \  p(y )  <  k,Ay  cx.  b.y  >  0}  .and 
Rm(k)  =  max{i(y)  j  p(y)  <  k,y  is  feasible}  . 

Obviously,  (i)  R  and  R~  are  increasing  functions,  (ii)  R  >  R",  (iii)  for  everjT  a  >  0, 
R(ak )  =  aR(k ),  and  (iv)  t~  <  R'{£)- 

The  vector  x  (computed  in  step  i)  is  such  that  i2(p(a:))  =  1  ,  and  hence  R(p(rx))  = 
r.  Since  p(x)  >  0,  Ui,rx  =  for  some  1  <  i  <  l.  Hence,  p{rx)  >1.  It  follows 
that 


r  <  R’{1)  <  R{£)  <  R(lp(rx))  =  lrR(p(x))  =  Ir  . 


I 


6.3  Bidirected  generalized  networks 


In  the  previous  sections  we  discussed  generalized  networks  where  the  flow  multipliers 
are  positive  numbers.  We  refer  to  the  edges  in  these  networks  as  tail-head  edges. 
Tail-head  edges  contribute  nonnegative  amount  of  flow  at  the  tail  end  of  the  edge 
and  a  proportional  nonpositive  amount  at  the  head  end.  In  bidirected  generalized 
networks  we  allow  two  additional  types  of  edges:  two-head  edges  and  two-tail  edges. 
The  properties  of  these  edge  types  are  shown  in  Figure  6.1.  Note  that  a  2-tail  edge 
can  be  viewed  as  a  tail-head  edge  with  a  negative  multiplier.  Bidi’-cted  generalized 
networks  are  a  generalization  of  bidirected  networks  (see  [42]).  In  biderected  networks 
the  multipliers  associated  with  the  edges  are  always  unity.  Bidirected  networks  were 
first  considered  by  Edmonds  [20]  who  related  them  to  non-bipartite  matching  theory. 
In  this  section  we  apply  the  methods  discussed  in  previous  sections  to  flow  problems 
on  bidirected  generalized  networks. 
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Flow  value:  x  Multiplier: 


-X 

ax 

X 

ax 

-. X 

• - ► 

- ax 
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a 

tail-head  edge 
2-head  edge 

2-tail  edge 


Figure  6.1:  Edge  types  of  a  bidirected  generalized  network 

6.3.1  UGT  on  bidirected  networks 

We  discuss  applying  the  Adler- Cosares  scheme  for  solving  UGT  on  bidirected  net¬ 
works.  A  bidirected  UGT  problem  has  the  form:  Minimize  cT x,  subject  to  Ax  =  6, 
x  >  0,  where  b  £  IT1,  c  €  iT”,  and  A  €  iT*xm  has  at  most  two  non-zero  entries 
per  column.  Note  that  head-tail  edges  correspond  to  columns  where  the  two  entries 
have  opposite  signs,  2-head  edges  correspond  to  columns  with  two  positive  entries, 
and  2-tail  edges  correspond  to  columns  with  two  negative  entries.  The  LP  dual  is 
the  problem  of  maximizing  bTy ,  subject  to  the  TVPI  system  ATy  <  c.  Recall  that 
when  only  head-tail  edges  are  present,  the  dual  has  monotone  constraints.  When 
head-tail  and  2-tail  edges  are  allowed,  AT  is  pre-Leontief  (see  Subsection  5.7),  and 
hence,  there  exist  a  vector  which  maximizes  all  variables.  The  Adler-Cosares  scheme 
is  applicable  when  b  >  0.  Similarly,  when  head-tail  and  2-head  edges  are  present 
—At  is  pre-Leontief,  and  hence,  there  exist  a  vector  which  minimizes  all  variables. 
The  Adler-Cosares  scheme  is  applicable  when  b  <  0.  When  all  3  types  of  edges  are 
present,  the  dual  comprises  a  general  TVPI  system.  The  algorithm  of  Chapter  5 
can  find  a  vector  which  maximizes/minimizes  a  single  variable.  The  Adler-Cosares 
scheme  is  applicable  to  problems  for  which  b  =  ±e*.  The  UGT  instances  for  which 
the  Adler-Cosares  scheme  is  applicable  are  listed  in  Table  6.3.  These  instances  can 
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Allowed  edge  types 

structure  of  the  dual  -  supply/demand  vectors 

ATy  <  c  -  TYPI  needed 

head-tail 

monotone  TYPI 

b  <  0  or  b  >  0 
monotone  (maximize  zzeTy) 

head-tail,  2-tail 

A1  is  pre-Leontief 

b  >  0 

monotone  (maximize  eTy ) 

head-tail,  2-head 

—A1  is  pre-Leontief 

6  <  0 

monotone  (minimize  eTy ) 

head-tail,  2-tail,  2-head 

general  TYPI 

bj  =  ±1,  bi  =  0  (i  ^  j) 
general  (maximize/minimize  yj ) 

Table  6.3:  Solving  UGT  on  bidirected  generalized  networks 


be  solved  using  a  single  application  of  the  algorithms  of  Chapter  5. 


6.3.2  Generalized  circulation  on  bidirected  networks 

We  consider  applying  the  approximation  algorithm  of  Section  6.2  to  generalized  cir¬ 
culation  problem  where  the  underlying  network  is  bidirected.  Recall  that  the  approx¬ 
imation  algorithm  iteratively  computes  a  feasible  flow  and  in  the  following  iteration 
considers  the  residual  graph.  Since  2-tail  edges  give  rise  to  2-head  edges  in  the  resid¬ 
ual  graph  (and  vice  versa),  we  only  consider  networks  where  all  three  edge  types  are 
present.  Note  that  when  all  three  edge  types  are  present  the* Adler- Cosares  scheme 
applies  to  UGT  instances  where  there  is  a  single  source  or  a  single  sink  (that  is,  b  =  e* 
or  6  =  —  e*  for  some  1  <  i  <  n). 

It  follows  that  the  approximation  scheme  presented  in  Section  6.2  can  be  used  to 
solve  bidirected  generalized  circulation  instances  where  6  =  ±ej  for  some  1  <  i  <  n. 


6.4  Concluding  remarks 


In  this  chapter  we  presented  algorithms  for  the  uncapacitated  generalized  transship¬ 
ment  (UGT)  problem  and  the  generalized  circulation  (GC)  problem.  We  also  consid¬ 
ered  the  UGT  and  GC  problems  on  bidirected  generalized  networks.  To  solve  UGT, 


6.4.  CON  CL  UDING  REMARKS 


149 


we  combined  a  scheme  by  Adler  and  Cosares  lj,  which  reduces  the  restricted  UGT 
problem  where  either  only  demand  nodes  or  only  supply  nodes  are  present  to  solving 
the  LP  dual,  with  the  algorithms  given  in  Chapter  5.  The  combination  yielded  better 
time  bounds  for  restricted  UGT  instances. 

In  order  to  utilize  the  UGT  algorithms  for  solving  the  capacitated  GC  problem, 
we  introduced  an  iterative  approximation  algorithm.  In  each  iteration,  we  consider 
a  UGT  instance,  with  costs  which  “capture”  the  relaxed  capacities.  The  solution  of 
this  UGT  instance  yields  an  approximate  solution  for  the  GC  instance.  The  next 
iteration  considers  the  residual  graph. 

We  comment  on  the  parallel  running  times  of  the  algorithms  mentioned  above. 
The  parallel  complexity  of  the  algorithms  of  Chapter  5  is  0{n)  using  0(mn )  processors 
for  the  deterministic  bound  and  0(m  +  n2)  processors  for  the  randomized  bound. 
The  algorithms  for  the  restricted  UGT  instances  have  the  same  complexity.  The 
approximation  algorithm  for  GC  runs  in  0(mn  log  e-1)  time  using  0(mn )  processors 
for  the  deterministic  bound  and  0(m  -f  n2)  processors  for  the  randomized  bound. 

A  problem  which  remains  open  regards  the  existence  of  a  strongly  polynomial  for 
the  unrestricted  UGT  problem  (where  many  sources  and  sinks  are  allowed).  The  LP 
dual  of  UGT  is  a  monotone  system  with  a  general  objective  function;  what  brings  as 
back  to  an  open  question  from  Chapter  5.  The  following  reduction  (similar  to  [52]) 
demonstrates  the  difficulty  of  the  problem.  An  instance  of  capacitated  generalized 
transshipment  can  be  reduced  in  linear  time  to  an  instance  of  UGT  with  3m  edges 
and  n  +  2m  nodes.  The  reduction  is  as  follows.  Consider  an  instance  of  generalized 
circulation  on  a  network  G  =  ( V ,  E)  where  d*  £  R  is  the  supply /demand  at  v  (v  € 
U),  and  ae,ue,Ct  are  the  multiplier,  capacity,  and  cost,  respectively,  of  the  edge  e 
(e  €  E).  The  corresponding  UGT  instance  G'  =  (V  U  W  U  W',E')  preserves  the 
supplies  and  demands  at  the  nodes  V.  Each  edge  e  €  E  has  corresponding  two  nodes 
we  £  W, w'e  €  W'  and  three  edges  in  E'  which  form  an  undirected  path.  The  node 
wt  has  demand  ue  and  w'e  has  supply  —ut.  Suppose  e  =  (wi,r2),  the  corresponding 
edges  are  (i)  (vi,tu)  with  multiplier  1  and  cost  Ce,  (ii)  (w',w)  with  multiplier  1  and 
cost  0,  and  (iii)  (tu',v2)  with  multiplier  at  and  cost  0. 
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Chapter  7 
Conclusion 


In  Chapter  2  we  presented  an  NC  and  strongly  polynomial  algorithm  to  detect  cycles 
in  fixed  dimensional  periodic  graphs.  This  algorithm  is  based  on  a  strongly  polyno¬ 
mial  algorithm  given  in  Chapter  3  for  the  parametric  minimum  cycle  problem  with 
fixed  number  of  parameters.  In  Chapter  4  we  presented  the  algorithm  of  Chapter  3 
as  a  general  tool  for  achieving  strongly  polynomial  time  bounds.  In  Chapter  5  we 
gave  faster  algorithms  for  linear  systems  of  inequalities,  where  at  most  two  variables 
appear  in  each  inequality.  In  Chapter  6  we  introduced  algorithms  for  some  gener¬ 
alized  network  flow  problems  which  utilize  the  results  of  Chapter  5.  In  particular 
we  obtained  a  faster  algorithm  for  approximating  the  optimal  generalized  circulation, 
which  is  also  the  first  strongly  polynomial  algorithm  for  the  problem.  The  existence  of 
this  approximation  algorithm  is  particularly  interesting  since  it  is  still  open  whether 
there  exist  a  strongly  polynomial  algorithm  for  computing  the  optimal  solution. 

These  results  are  interesting  in  and  of  themselves,  but  our  work  is  a  step  towards 
attacking  an  important  open  problem  of  the  field:  finding  a  strongly  polynomial 
algorithm  for  general  linear  programming. 
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